GenCore version 5,1.6 
Copyright (c) 1993 - 2004 Compugen Ltd, 



OM nucleic - nucleic search, using sw model 



Run on : 



Title: 

Perfect score: 
Sequence : 



August 15, 2004, 04:17:29 ; Search time 2848,01 Seconds 

(without alignments) 
9405,294 Million cell updates/sec 

US-09-864-675-3 
897 

1 atgaggcgcgacccggcccc caatggtcaacttctcctaa 897 



Scoring table: 



IDENTITY_NUC 

Gapop 10,0 , Gapext 1.0 



Searched: 



27513289 seqs, 14931090276 residues 



Total number of hits satisfying chosen parameters; 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



55026578 



Database 



EST: 



1 
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4 
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7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 



em_estba : * 
em_esthum: * 
em_estin : * 
em_estmu: * 
em_estov: * 
em_estpl : * 
em_estro : * 
em_htc : * 
gb_estl : * 
gb_est2 : * 
gb_htc: * 
gb_est3: * 
gb_est4 : * 
gb_est5 : * 
em_estf un : * 
em_estom: * 
em_gss_hum: * 
em_gss_inv: * 
em_gss_pln: * 
em_gss_vrt : * 
em_gss_fun: * 
em_gss_mam: * 
em_gss_mus : * 
em_gss_pro : * 
em_gss_rod: * 
em_gss_phg: ^ 
em_gss vrl:* 



28: gb_gssl:* 
29: gb_gss2:* 

Pred, No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed^, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 





No, 


Score 


Match 


Length 


DB 


ID 


Description 




1 


674 


75. 


1 


805 


12 


BI918620 


HI 9 18 620 603176570 




2 


565,6 


63. 


1 


1047 


12 


BM914622 


BM914 622 AGF.NroIIRT 


c 


3 


467.2 


52. 


1 


1041 


12 


BI412864 


BI412864 602988202 




4 


467 


52, 


1 


524 


13 


BX281777 


RX281777 Ry?ft1777 




5 


408 . 2 


45, 


5 


549 


9 


AA706226 


AA706226 ahPftan7 ^ 




6 


396. 6 


44. 


2 


412 


9 


AI041451 


AI041451 ow36c02,s 




7 


363. 8 


40, 


6 


427 


10 


BF108794 


BF108794 7]S?an^ v 




8 


318 . 6 


35. 


5 


488 


13 


BX529505 


BX529505 BX529505 


c 


9 


255 


28. 


4 


949 


12 


BI410828 


BI410828 602963734 




10 


234.4 


26. 


1 


333 


10 


BE983573 


BE983573 UI-M-CGOn 




11 


224 


25. 


0 


297 


9 


AA772412 


AA772412 ai44el2 s 




12 


210.2 


23. 


4 


795 


12 


BI651936 


BI651936 603298677 




13 


195.2 


21, 


8 


259 


10 


BE648780 


BE648780 UI-M-BH2 , 




14 


195.2 


21, 


8 


327 


9 


AA968077 


AA968077 uh09h01 r 


c 


15 


182 . 6 


20. 


4 


362 


13 


BX089049 


BX0 8 904 9 BX0 8 904 9 




16 


169, 8 


18. 


9 


529 


10 


AW476657 


AW476657 uq79e01.y 


c 


17 


154. 8 


17 . 


3 


539 


9 


AL918370 


7VL918370 AL918370 




18 


142 


15. 


8 


657 


13 


BQ078813 


BQ078813 fy81c06, y 




19 


111 


12, 


4 


256 


10 


AW762061 


AW762061 ur53c01.y 




20 


107. 6 


12, 


0 


458 


9 


AI152190 


AI152190 udl8hlO,r 




21 


105.2 


11. 


7 


750 


28 


BZ847665 


BZ847665 CH240 239 




22 


101 


11. 


3 


493 


28 


BH057870 


BH057 870 RPCI-24-9 


c 


23 


100,4 


11. 


2 


481 


28 


AZ987593 


AZ987593 2M0270P10 


c 


24 


97.4 


10. 


9 


180 


29 


CE103297 


CE103297 tigr-gss- 




25 


90.2 


10, 


1 


243 


10 


BB570162 


BB570162 BB570162 


c 


26 


85 


9. 


5 


512 


9 


AI073386 


AI073386 ool3d06,x 




27 


81.2 


9. 


1 


167 


9 


AI836531 


AI836531 UI-M-APO- 




28 


76.6 


8 . 


5 


477 


10 


BE984041 


BE984041 UI-M-CGOp 




29 


70.8 


7. 


9 


761 


13 


BX876483 


BX876483 BX876483 




30 


67.2 


7. 


5 


765 


12 


BI522417 


BI522417 603175321 




31 


67 


7. 


5 


769 


12 


BI413085 


BI413085 602990205 




32 


66 


7. 


4 


751 


29 


CNS04J6G 


7VL293137 Tetraodon 




33 


64, 8 


7. 


2 


321 


10 


BE983721 


BE983721 UI-M-CGOp 




34 


64,4 


7. 


2 


538 


9 


AL925790 


AL925790 AL925790 


c 


35 


61 


6. 


8 


491 


9 


AL909688 


AL909688 AL909688 


c 


36 


60.2 


6. 


7 


322 


9 


AL909689 


AL909689 AL909689 




37 


59 


6. 


6 


356 


29 


CG614660 


CG614660 OST303096 




38 


58.8 


6. 


6 


363 


29 


CE326275 


CE326275 tigr-gss- 




39 


58.8 


6. 


6 


413 


14 


N62228 


N62228 yz63c08,sl 




40 


58.4 


6. 


5 


685 


14 


CA351220 


CA351220 622234 NC 




41 


53 


5. 


9 


1630 


11 


AK051824 


AK051824 Mus muscu 




42 


51. 6 


5. 


8 


925 


29 


CNS0091P 


AL053013 Drosophil 


c 


43 


51. 6 


5, 


8 


982 


13 


BX415111 


BX415111 BX415111 




44 


50. 8 


5. 


7 


647 


12 


BI960178 


BI960178 HVSMEn002 




45 


50,4 


5. 


6 


251 


9 


AW045376 


AW045376 UI-M-BHl- 



ALIGNMENTS 



RESULT 1 
BI918620 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



ORIGIN 



BI918620 805 bp mRNA linear EST 16-OCT-2001 

603176570F1 NIH_MGC_121 Homo sapiens cDNA clone IMAGE : 524 0969 5^, 
mRNA sequence. 
BI918620 

BI918620.1 GI:16182295 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 805) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
http : //image . llnl . gov 
Plate: LLAM11607 row: k column: 18 
High quality sequence start: 2 
High quality sequence stop: 77 8. 

Location/ Qualifiers 

1. .805 

/organism="Homo sapiens" 

/ mo l_t y p e = "mRNA" 

/db_xref="taxon: 9606" 

/clone="IMAGE: 5240969" 

/lab_host="DHlOB" 

/ clone_lib="NIH_MGC_12 1 " 

/note="Organ: brain; Vector: pCMV-SP0RT6; Site_l: NotI; 
Site_2 : EcoRV (destroyed); RNA source anonymous pool of 3 
fetal brains, female age 20 weeks, female age 24 weeks, 
and male age 26 weeks. Library is oligo-dT primed and 
directionally cloned (EcoRV site is destroyed upon 
cloning). Average insert size 1.7 kb, insert size range 
0.7-3.5 kb . Library is normalized and enriched for 
full-length clones and was constructed by C. Gruber 
(Invitrogen) . Research Genetics tracking code 017. Note: 
this is a NIH_MGC Library." 



Query Match 75.1%; Score 674; DB 12; Length 8 05; 

Best Local Similarity 98.7%; Pred. No. 2.8e-127; 

Matches 732; Conservative 0; Mismatches 5; Indels 5; Gaps 



5; 



Qy 1 ATGAGGCGCGACCCGGCCCCCGGC-TTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTG 59 



Db 


64 


ATGAGGCGCGACCCGGCCCCCGGCGTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTG 


123 


Qy 


60 


CTACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGA 


119 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


124 


CTACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTAC7VAGGCACCCGTGGTGGTGGA 


183 


Qy 


120 


GGGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCC 


179 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 




Db 


184 


GGGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCC 


243 


Qy 


18 0 


GCCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGG 


239 






1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 




Db 


244 


GCCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGG 


303 


Qy 


240 


GGGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCA 


299 






1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 M 




Db 


304 


GGGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCA 


363 


Qy 


300 


GCGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTG- 


358 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 




Db 


364 


GCGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGT 


423 


Qy 


359 


CCCCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTG 


418 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


424 


CCCCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTG 


483 


Qy 


419 


ACTGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGA 


478 






1 1 1 1 1 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 




Db 


484 


ACTGCGCCACCCGGCCCAAGTTGAAGAAGATGACGAGCCAGACGGGACAGGTGGGTGAGA 


543 


Qy 


479 


AGCAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCA 


538 






1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


544 


AGCAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCA 


603 


Qy 


539 


AGGATGGCAAGGAGCTCAACCG-CAGCCGAGACATTCGCATCAAATATGGCAACGGCAGA 


597 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 M 1 1 1 1 1 1 1 1 1 




Db 


604 


AGGATGGCAAGGAGCTCAACCGTCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGA 


663 


Qy 


598 


AAGAACT C AC GACT AC AGTT CAAC AAGGT GAAGGT GGAGGAC GOT GGGGA- GT AT GT CT G 


656 






1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 M 1 1 II II 




Db 


664 


AAGAACTCACGACTACAGTTCAACAAGGTGT^GGTGGAGGACGCTGGGGAGGTATGTCTG 


723 


Qy 


657 


CGAGG-CCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCG 


715 






1 1 1 1 1 1 1 1 II 1 1 1 II M 1 M 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 M 




Db 


724 


CGAGGCCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGTTTTACGTCAACAGGT 


783 


Qy 


716 


TGAGCACCACCCTGTCATCCTG 737 




Db 


784 


1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M 
TGAGCACCAACCTGTCATCCTG 8 05 





RESULT 2 
BM914622 

LOCUS BM914622 1047 bp mRNA linear EST 12-MAR-2002 

DEFINITION AGENCOURT_6615334 NIH_MGC_113 Homo sapiens cDNA clone IMAGE : 54 80308 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



ORIGIN 



5', mRNA sequence. 
BM914622 

BM914 622.1 GI: 19365001 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1047) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collect! on (MGC) 
Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Mark Watson 
cDNA Library Preparation: Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consortium/LLNL at: 

http : //image . llnl . gov 

Plate: LLCM2002 row: p column: 05 

High quality sequence stop: 541. 
Location/Qualifiers 
1. .1047 

/organism="Homo sapiens" 
/mol_type= "mRNA" 
/db_xref="taxon: 9606" 
/clone="IMAGE: 5480308" 
/lab_host="DH10B (phage-resistant ) " 
/clone_lib="NIH_MGC_113" 

/note="Organ: spleen; Vector: pOTB7; Site_l: Xhol; Site_2: 
EcoRI; cDNA made by oligo-dT priming. Directionally cloned 
into EcoRI/XhoI sites using the following 5' adaptor: 
GGCACGAG(G) . Library constructed by Ling Hong in the 
laboratory of Gerald M. Rubin (University of California, 
Berkeley) using ZAP-cDNA synthesis kit (Stratagene) and 
Superscript II RT (Life Technologies) . Note: this is a 
NIH_MGC Library. " 



Query Match 63.1%; Score 565.6; DB 12; Length 1047; 

Best Local Similarity 96.8%; Pred. No. 4.4e-105; 

Matches 577; Conservative 0; Mismatches 19; Indels 0; 



Gaps 



0; 



Qy 



Db 



272 GCTCCTGTGTGCCGCTCGAAAGGAACCAGCGCTACATCTTTTTCCTGGAGCCCACGGAAC 331 
I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M 
1 GCTCCTGTGTGCCGCTCGAAAGGAACCAGCGCTACATCTTTTTCCTGGAGCCCACGGAAC 60 



Qy 

Db 

Qy 

Db 



332 



61 



392 



AGCCCTTAGTCTTTAAGACGGCCTTTGCCCCCCTCGATACCAACGGCAAAAATCTCAAGA 

I M I I I I I M I I I I I I M I I I M I M I I I I I I I I I M I M M I I I I I I I I I I I I I I I I I I 

AGCCCTTAGTCTTTAAGACGGCCTTTGCCCCCCTCGATACCAACGGCAAAAATCTCAAGA 



AAGAGGTGGGCTVAGATCCTGTGCACTGACTGCGCCACCCGGCCCAAGTTGAAGAAGATGA 
I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I 
121 AAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCCGGCCCAAGTTGAAGAAGATGA 



391 



120 



451 



180 



Qy 452 AGAGCCAGACGGGACAGGTGGGTGAGAAGCAATCGCTGAAGTGTGAGGCAGCAGCCGGTA 511 

M I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I 
Db 181 AGAGCCAGACGGGACAGGTGGGTGAGAAGCAATCGCTGAAGTGTGAGGCAGCAGCCGGTA 24 0 

Qy 512 ATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGGCAAGGAGCTCAACCGCAGCCGAGACA 571 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 ATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGGCAAGGAGCTCAACCGCAGCCGAGACA 300 

Qy 572 TTCGCATCAAATATGGCAACGGCAGAAAGAACTCACGACTACAGTTCAACAAGGTGAAGG 631 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 TTCGCATCAAATATGGCAACGGCAGAAAGAACTCACGACTACAGTTCAACAAGGTGAAGG 360 

Qy 632 TGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGAGAACATCCTGGGGAAGGACACCGTCC 691 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M 
Db 361 TGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGAGAACATCCTGGGGAAGGACACCGTCC 42 0 

Qy 692 GGGGCCGGCTTTACGTCAACAGCGTGAGCACCACCCTGTCATCCTGGTCGGGGCACGCCC 751 

I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I 
Db 421 GGGGCCGGCTTTACGTCAACAGCGTGAGCACCACCCTGTCATCCTGGTCGGGGCACGCCC 480 

Qy 752 GGAAGTGCAACGAGACAGCCAAGTCCTATTGCGTCAATGGAGGCGTCTGCTACTACATCG 811 

I I I I I M I M I I I I I I I I I I I I I M I I M I I I M I I I I I I M I I I I I I I I I I I I I I I I I 

Db 481 GGAAGTGCAACGAGACAGCCAAGTCCTATTGCGTCAATGGAGGCGCCTGCTACTACATCG 54 0 

Qy 812 AGGGCATCAACCAGCTCTCCTGCAAGTGTCCTGTGGGATACACCGGGGACAGGTGT 867 

I I I I I I I I I I I I I I I I I I I I I I I I M I mil II I III 

Db 541 AGGCCATCAATCAGCTTTCCTGCAAATGTCCCAATGGATTCTTCCGACCAACATGT 596 



RESULT 3 

BI412864/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BI412864 1041 bp mRNA linear EST 14-AUG-2001 

602988202F1 NCI_CGAP_Lu33 Mus musculus cDNA clone IMAGE : 5144 016 5', 
mRNA sequence. 
BI412864 

BI 412 8 64 . 1 01:15173787 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 1041) 
NIH-MGC http : //mgc. nci , nih. gov/ . 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r(amail . nih . gov 

Tissue Procurement: Gilbert Smith, Ph.D. 

cDNA Library Preparation: M. Bento Scares, Ph.D., M. Fatima 
Bonaldo, Ph.D. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by:Incyte Genomics, Inc. 

Clone distribution: NCI-CGAP clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
http : //image . llnl . gov 
Plate: LLAM11355 row: d column: 01 
High quality sequence start: 11 



High quality sequence stop: 645. 
FEATURES Location/Qualifiers 
source 1. .1041 

/ organism="Mus musculus " 
/mol_type="mRNA" 
/strain="Czech II" 
/db_xref="taxon: 10090" 
/clone="IMAGE: 5144 016" 
/tissue_type="pooled lung tumors" 
/lab_host="DH10B (phage-resistant ) " 
/ clone_lib= "NCI_CGAP_Lu3 3 " 

/note="Organ: lung; Vector: pT7T3D-Pac (Pharmacia) with a 
modified polylinker; Site_l: NotI; Site_2: EcoRI; 1st 
strand cDNA was prepared from mRNA obtained from pooled 
lung tumors with a Not I — oligo(dT) primer [5* 
TGTTACCAATCTGAAGTGGGAGCGGCCGCCTCTGTTTTTTTTTTTTTTTTT 3 ' ] . 
Double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia), digested with Not I and cloned into the Not 
I and Eco RI sites of the modified pT7T3 vector. Library 
went through one round of normalization, and was 
constructed by Bento Soares and M. Fatima Bonaldo. " 

ORIGIN 

Query Match 52.1%; Score 467.2; DB 12; Length 1041; 

Best Local Similarity 86,6%; Pred. No. 5.7e-85; 

Matches 563; Conservative 0; Mismatches 78; Indels 9; Gaps 4; 

Qy 165 CAGCACCCGAGAGCCGCCCGCCTCGGGTCGGGT GGCGTTGGTAAAGGTGCTGGACA 220 

I I I I II II I I I I I I I I I II II III I I I I I M I II I I I I I I I I 

Db 656 CACCTCGAGATGCCCGCCCGCCTCGGGTTCGGTTGGCGTCTTGGTGAAAGGTGCTGGACA 597 

Qy 221 AGTGGCCG — CTCCGGAGCGGGGGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTG 278 

I I I I II I I I I I I I I I I I II I I I M I II I II M I I I I I I II I I I I I I I I II I I I I 

Db 596 AGTTGCCGGCTCCCGGATCGGGGGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTG 537 

Qy 279 TGTGCCGCTCGAAAGGAACCAGCGCTACATCTT-TTTCCTGGAGCCCACGGi^ACAGCCCT 337 

II I M I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I M I I II I II I I II 

Db 536 TGCGCCGCTCGAAAGGAACCAGCGCTACATCTTGTTTCCTGGAGCCCACCGAGCAGCCCT 477 

Qy 338 TAGTCTTTAAGACGGCCTTTGCCCCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGG 397 

I I M I I I I I I I I II I I I I MM I I II II I I II I M I I M I I II I I 

Db 476 TAGTTTTT7UVGACAGCCTTTTGCCCCGGTCGACCCTACGGCAAATACATCAAGAAAGAGG 417 

Qy 398 TGGGCAAGATCCTGTGCACTGACTGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCC 4 57 

II II II II M I I I I I M I I I I I I I M II I I I I I I I I II I I I M I I I I M II I II I M I I 

Db 416 TGGGCAAGATCCTGTGCACTGACTGCGCCACCCGGCCCAAGCTGAAGAAGATGAAGAGCC 357 

Qy 4 58 AGACGGGACAGGTGGGTGAGAAGCAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCC 517 

MM Ml II I I I I M I M I I I II I I I II II I I II I I I II II I M II M MM 

Db 356 AGACAGGAGAGGTGGGTGAGAAGCAGTCGCTCAAGTGTGAGGCAGCGGCGGGAAACCCCC 2 97 

Qy 518 AGCCTTCCTACCGTTGGTTCAAGGATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCA 577 

MM Mill II I I I I II I I I I M M I I I I I I I I II I I M II M II M I M I I 

Db 296 AGCCCTCCTATCGCTGGTTCAAGGATGGCAAGGAACTCAACCGGAGTCGTGATATTCGCA 237 



Qy 



578 TCAAATATGGC7\ACGGCAGAAAGAACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGG 637 

MM II M I II I I I I I M I II I II I I II I M I I I M I M II I I II II I I I I I I M 



Db 



236 TCAAGTATGGCAATGGCAGAAAGAACTCACGGCTACAGTTCAACAAAGTGAGGGTGGAGG 177 



Qy 

Db 

Qy 

Db 

Qy 

Db 



638 ACGCTGGGGAGTATGTCTGCGAGGCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCC 697 

I M I I I I I I I I I I I I I I I I I i I M I I I I I I I M I I I I M I I I I I I M I I I I I I 
17 6 ATGCCGGGGAGTACGTCTGTGAGGCCGAGAACATCCTTGGGAAGGACACCGTGAGGGGCC 117 



698 



758 



GCAACGAGACAGCCAAGTCCTA — TTGCGTCAATGGAGGCGTCTGCTACT 
I I I I Mill I I II II I I M I II II I I I I I I I II II I I I I I I I 
56 GCAATGAGACCGCCAAGTCCTACCATGTGTGAATGGAGGCGTGTGCTACT 7 



805 



757 



GGCTTTACGTCAACAGCGTGAGCACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGT 
I M I I I I I I I I I I II II I I I I I I I I I I I I I II I I I I I I I I I II II I I I I I I I I 
116 GACTCCATGTCAACAGCGTGAGCACCACTCTGTCATCCTGGTCGGGACATGCCCGGAAGT 57 



RESULT 4 
BX281777 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BX281777 524 bp mRNA linear EST 04-MAR-2003 

BX281777 NIH_MGC_121 Homo sapiens cDNA clone IMAGp998K1811607 ; 
IMAGE: 5240969, mRNA sequence. 
BX281777 

BX281777. 1 GI:28612804 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 524) 

Ebert,L., Heil,0., Hennig.S., Neubert,P., Partsch,E., Peters, M., 
Radelof,U., Schneider, D. and Korn,B. 
Human UnigeneSet - RZPD3 
Unpublished (2003) 
Contact: Ina Rolfs 

RZPD Deutsches Ressourcenzentrum fuer Genomf orschung GmbH 
Im Neuenheimer Feld 580, D-69120 Heidelberg, Germany 
RZPD; IMAGp998K1811607. 

RZPDLIB; I.M.A.G.E. cDNA Clone Collection; 
Human UnigeneSet - RZPD3 (RZPDLIB No. 972) 
http : //www. rzpd.de/CloneCards/cgi- 
bin/showLib.pl.cgi/response?libNo=972 Contact: Ina Rolfs 
RZPD Deutsches Ressourcenzentrum fuer Genomf orschung GmbH 
Heubnerweg 6, D-14 059 Berlin, Germany 
Tel: +49 30 32639 101 
Fax: +49 30 32639 111 
www. rzpd.de 

This clone is available royalty-free from RZPD; 

contact RZPD {clone@rzpd.de) for further information. Seq primer: 
M13u, Primer sequence: CGTTGTAAAACGACGGCCAGT . 

Location/Qualifiers 

1. .524 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 

/clone="IMAGp998Kl811607 ; IMAGE: 5240969" 

/lab_host="DH10B" 

/clone_lib="NIH_MGC_121" 

/note="Organ: brain; Vector: pCMV-SP0RT6; Site_l: NotI; 



Site_2: EcoRV (destroyed); RNA source anonymous pool of 3 
fetal brains, female age 20 weeks, female age 24 weeks, 
and male age 26 weeks. Library is oligo-dT primed and 
directionally cloned (EcoRV site is destroyed upon 
cloning). Average insert size 1.7 kb, insert size range 
0.7-3.5 kb. Library is normalized and enriched for 
full-length clones and was constructed by C. Gruber 
(Invitrogen) . Research Genetics tracking code 017. Note: 
this is a NIH_MGC Library." 

ORIGIN 

Query Match 52.1%; Score 4 67; DB 13; Length 524; 

Best Local Similarity 100.0%; Pred. No. 4.7e-85; 

Matches 4 67; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I 

58 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 117 
Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I M 

Db 118 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACT^GGCACCCGTGGTGGTGGAG 177 

Qy 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 180 

I I I I I M M I I I I I I I I I I M I I I I I I I I I I I M I I I I M I I M I I I I I I I I I I I 

Db 178 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 237 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

M I I I I I I I I I I I I I I I I I I I I I I I M M M I I I I M I I I I I I I I I I I I I I I I I I I I I I I 

Db 238 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 297 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I M I I I I I M I I I I I I I I I I M I 

2 98 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 357 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 358 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 417 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 420 

I M I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I M I I I I I I I I 

418 CCCCTCGATACCAACGGCAAA7VATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 47 7 
Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACA 467 

M I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

Db 478 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACA 524 



RESULT 5 
AA706226 
LOCUS 

DEFINITION 



ACCESSION 

VERSION 

KEYWORDS 



AA706226 549 bp mRNA linear EST 12-JAN-1999 

ah28a07.sl Soares_parathyroid_tumor_NbHPA Homo sapiens cDNA clone 
1240116 3* similar to TR:P43328 P43328 NEU DIFFERENTIATION FACTOR 
NDF04 ;, mRNA sequence. 
AA706226 

AA7 0 6226. 1 GI: 2716144 
EST. 



SOURCE 

0RG7\NISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 549) 

NCI-CGAP http : / /www. ncbi . nlm. nih . gov/ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 
Tumor Gene Index 
Unpublished (1997) 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r(5mail , nih . gov 

cDNA Library Preparation: M. Bento Scares, Ph.D., M. Fatima 
Bonaldo, Ph.D. 
cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing Center 
Clone distribution: NCI-CGAP clone distribution information can be 

found through the I.M.A.G.E. Consortium/LLNL at: 

www-bio. llnl . gov/bbrp/image/ image . html 

Possible reversed clone: similarity on wrong strand 
Possible reversed clone: polyT not found 
Insert Length: 689 Std Error: 0.00 
Seq primer: -40ml3 fwd. ET from Amersham 
High quality sequence stop: 451. 

Location/Qualifiers 

1. .549 

/organism="Homo sapiens" 
/mol_t ype= "mRNA" 
/db_xref="taxon: 9606" 
/clone="1240116" 

/ tissue_type="parathyroid tumor" 
/ de v_s tage= " adult " 

/lab_host="DH10B (ampicillin resistant)" 
/ clone_lib="Soares_parathyroid_tumor_NbHPA" 
/note="Organ: parathyroid gland; Vector: pT7T3D 
(Pharmacia) with a modified polylinker; Site_l: Not I; 
Site_2: Eco RI; 1st strand cDNA was primed with a Not I - 
oligo(dT) primer 

[ 5 ' -TGTTACCAATCTGAAGTGGGAGCGGCCGCACCAATTTTTTTTTTTTTTTTTTTT 
TTTTT-3M, double-stranded cDNA was size selected, ligated 
to Eco RI adapters (Pharmacia), digested with Not I and 
cloned into the Not I and Eco RI sites of a modified pT7T3 
vector (Pharmacia). Library went through one round of 
normalization to a Cot = 5. Library constructed by Bento 
Soares and M. Fatima Bonaldo. RNA from sporadic parathyroid 
adenomas was kindly provided by Dr. Stephen Marx, National 
Institute of Diabetes and Digestive and Kidney Diseases, 
NIH. " 



ORIGIN 



Query Match 45.5%; Score 408,2; DB 9; Length 549; 

Best Local Similarity 91.7%; Pred. No. 5e-73; 

Matches 431; Conservative 0; Mismatches 39; Indels 0; Gaps 0; 

424 GCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAA 4 83 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1^ GCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAA 74 



Qy 


484 


TCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGAT 

1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 ( 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 L . . 

1 > ' ' 1 M M 1 M M M M M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 I M 1 1 1 I I I I I I I I M 1 1 1 I M 

TCGCTGAAGTGTGAGGCAGCAGCGGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGAT 


543 


Db 


75 


134 


Qy 


544 


GGCAAGGAGCTCTyVCCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAAC 

1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 t 1 t 1 1 1 1 1 1 1 i 1 1 1 I 1 1 r t 1 1 I 1 1 1 1 1 1 1 1 . . 

' I 1 M M M M 1 M M M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 { 1 1 1 1 I 1 1 1 1 1 1 1 1 1 I I { 1 1 1 1 1 
GGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAAC 


603 


Db 


135 


194 


Qy 


604 


TCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

> 1 1 1 M M M M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 

TCACGACTACAGTTCAAC7VAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCC 


663 


Db 


195 


254 


Qy 


664 


GAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCACC 
1 1 1 1 1 I 1 1 1 t 1 1 1 1 1 t 1 1 1 t 1 1 1 1 I 1 1 ) 1 1 1 1 t 1 1 1 1 1 I 1 ... . 

' 1 > 1 M M M M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M M III 

GAGAACATCCTGGGGAAGGACACCGTCGGAGGCCGGCTTTACGTCAACAGCGTGACGACC 


723 


Db 


255 


314 


Qy 


724 


ACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTGC 

1 I 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M 1 M M 1 II 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 

ACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGNGACAGCCAAGTCCTATTGC 


783 


Db 


315 


374 


Qy 


784 


GTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCCT 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I M II 1 1 1 1 1 1 1 M 1 1 1 II M 1 1 1 1 1 1 1 1 M 1 III 

GTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGGCACCT 


843 


Db 


375 


434 


Qy 


844 


GTGGGATACACCGGGGACAGGTGTCAGCAGTTCGCAATGGTCAACTTCTC 893 
II III M II 1 1 1 1 1 1 1 1 M 1 
GGGCTGCACTGCTTAGAACTTGGTACCCAGAGCCACCACTTCCCCATCTC 48 4 




Db 


435 





RESULT 6 
AI041451 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



AI041451 412 bp mRNA linear EST 28-AUG-1998 

ow36c02.sl Soares_parathyroid_tumor_NbHPA Homo sapiens cDNA clone 
IMAGE:1648898 3' similar to TR:014511 014511 NTAK. mRNA 
sequence . 
AI041451 

AI041451,1 GI:3280645 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 412) 

NCI-CGAP http : / /www. ncbi . nlm. nih, gov/ncicgap . 

National Cancer Institute^ Cancer Genome Anatomy Project (CGAP) , 
Tumor Gene Index 
Unpublished (1997) 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail . nih . gov 

cDNA Library Preparation: M. Bento Scares, Ph.D., M. Fatima 
Bonaldo, Ph.D. 
cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing Center 
Clone distribution: NCI-CGAP clone distribution information can be 

found through the I.M.A.G.E. Consortium/LLNL at: 

www-bio . llnl . gov/bbrp/image/image . html 



Trace considered overall poor quality 
Insert Length: 671 Std Error: 0,00 
Seq primer: -4 0ml3 fwd. ET from Amersham 
High quality sequence stop: 1. 
FEATURES Location/Qualifiers 
source 1. ,412 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

/clone=" IMAGE: 1648898" 

/tissue_type="parathyroid tumor" 

/dev_stage= "adult" 

/lab_host="DH10B (ampicillin resistant)" 
/clone_lib="Soares__parathyroid tumor NbHPA" 
/note="Organ: parathyroid gland; Vector: pT7T3D 
(Pharmacia) with a modified polylinker; Site_l: Not I; 
Site_2: Eco RI ; 1st strand cDNA was primed with a Not I - 
oligo(dT) primer 

[5 ' -TGTTACCAATCTGAAGTGGGAGCGGCCGCACCAATTTTTTTTTTTTTTTTTTTT 
TTTTT-3'], double-stranded cDNA was size selected, ligated 
to Eco RI adapters (Pharmacia), digested with Not I and 
cloned into the Not I and Eco RI sites of a modified pT7T3 
vector (Pharmacia) . Library went through one round of 
normalization to a Cot = 5. Library constructed by Bento 
Soares and M.Fatima Bonaldo. RNA from sporadic parathyroid 
adenomas was kindly provided by Dr. Stephen Marx, National 
Institute of Diabetes and Digestive and Kidney Diseases, 
NIH. " 

ORIGIN 



Query Match 44.2%; Score 396.6; DB9; Length 412; 

Best Local Similarity 97.6%; Pred. No, le-70; 

Matches 402; Conservative 0; Mismatches 10; Indels 0; Gaps 



Qy 


426 


CACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAATC 

1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 M M 1 1 1 1 

CACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAATC 


485 


Db 


1 


60 


Qy 


486 


GCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGG 

N 1 1 M 1 1 1 1 1 M M 1 1 1 1 1 1 IIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIII 

GCTGAAGTGTGAGGCAGCAGCGATAAATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGG 


545 


Db 


61 


120 


Qy 


546 


CAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAACTC 

1 I 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 M 1 1 1 1 M 1 M 1 1 1 1 1 M 1 M 1 M M 

CAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAACTC 


605 


Db 


121 


180 


Qy 


606 


ACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGA 

N M 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

ACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGA 


665 


Db 


181 


240 


Qy 


666 


GAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCACCAC 

III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II III II 1 1 1 1 1 1 1 1 1 1 1 1 1 M I 1 1 M 

GAACATCCTGGGGAAGGACACCGTACGAGGCCGGCTTTACGTCAACAGCGTGACGACCAC 


725 


Db 


241 


300 


Qy 


726 


CCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTGCGT 

1 1 1 1 1 1 1 1 1 N 1 1 1 1 1 M M 1 1 1 M 1 II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 


785 



301 CCTGTCATCCTGGTCGGGGCACGCCGGGAAGTGCAACGNGACAGCCAAGTCCTATTGCGT 360 



Qy 



Db 



78 6 CAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAG 837 

I I I I I I I M M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 

361 CAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAG 412 



RESULT 7 
BF108794 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



ORIGIN 



BF108794 427 bp mRNA linear EST 20-OCT-2000 

7152g03,xl Soares_NSF_F8_9W_OT_PA_P_Sl Homo sapiens cDNA clone 
IMAGE: 3525292 3* similar to SW : NTAK_HUMAN 014511 NTAK PROTEIN 
; contains MSRl.tl MSRl repetitive element mRNA sequence 
BF108794 

BF108794 .1 GI:10938484 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 427) 

NCI-CGAP http : //www. ncbi . nlm. nih . gov/ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 
Tumor Gene Index 
Unpublished (1997) 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail . nih . gov 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (inf o@image . llnl , gov) for further information. 
Seq primer: -4 0UP from Gibco 
High quality sequence stop: 396. 

Location/Qualifiers 

1. .427 

/organism="Homo sapiens" 
/mol_type=:"mRNA" 
/db_xref="taxon: 9606" 
/clone="IMAGE : 3525292 " 
/lab_host="DH10B" 

/clone_lib="Soares_NSF_F8_9W_OT PA P SI" 

/note="Organ: pooled; Vector? pT7T3D~Pac (Pharmacia) with 
a modified polylinker; Site_l: Not I; Site_2: Eco RI; 
Equal amounts of plasmid DNA from five normalized 
libraries were mixed, and ss circles were made in vitro. 
Following HAP purification, this DNA was used as tracer in 
a subtractive hybridization reaction. The driver was 
PCR-amplified cDNAs from pools of 5,000 clones made from 
the same 5 libraries. The pools consisted of the following 
libraries and clonelDs: Soares NbHSF pool 1: 
309384-310919, 323208-325895 Soares Nb2HP pool 1: 
145032-147335, 147720-148103, 148872-149255, 15002 - 
150407, 151176-152327 Soares Nb2HF8-9W pool 1: 
758280-760583, 772104-774407 Soares NbHPA pool 1: 
304776-306311, 320136-322823, 326280-326663 Soares NbHOT 
pool 1: 723720-726407, 73908 0-74 0999 Subtraction by Bento 
Soares and M. Fatima Bonaldo." 



Query Match 40.6%; Score 363.8; DBIO; Length 427; 

Best Local Similarity 91.3%; Pred. No. 5.4e-64; 

Matches 386; Conservative 0; Mismatches 37; Indels 0; Gaps 0; 

CCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTG 422 

I I III I I 11 I I I I I I I I II 

CCGCGGCAAGAAGCACCCAGAGGGGAGGAAGCGGGAGAGGGAGCCCGATCCCGGGGAGAA 6 4 

Qy 423 CGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 4 82 

I I I I I I I I I I N 11 I I M I I I I I I I I II I II I II M M I I I I 

DJD 65 AGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 124 

Qy 483 ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 542 

I I I I I I I I I I I I I I I N I 11 I 11 I 11 M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 184 



Qy 363 
Db 5 



Db 125 
Qy 543 
Db 185 



TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 602 

I I < I I I I I I I I I I I I I I M I I I I I I I I I I I M I M I I I I I I I I I I I I I I II I I I I I I I M 

TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 



244 



Qy 603 CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 662 

I I ' I I I I I I I I I I I I I I I I I 11 IN I I I II 11 I I! I I I I I I I M M I I M I I 

Db 245 CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 304 

Qy 663 CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 722 

I I I I 11 I I I I II II M I I M I I I I I I I II I I I I I II I I I I I I I I I I I I I 

CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 364 



Db 305 



Qy 723 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATT' 



Db 365 



G V 8 2 

I ' I ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 iTi 1 1 im 1 1 

CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 424 



Qy 783 CGT 785 

I I I 

Db 425 CGT 42 7 



RESULT 8 
BX529505 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
COMMENT 



BX529505 488 bp mRNA linear EST 27-JUN-2003 

BX529505 NCI_CGAP_Mam3 Mus musculus cDNA clone IMAGp998N017639 • 
IMAGE: 3153984, mRNA sequence. 
BX529505 

BX529505.1 GI:32297863 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 
1 (bases 1 to 488) 
Heil.O,, Ebert,L., Neubert.P., 
and Korn^B. 

Mouse UnigeneSet - RZPD2 
Unpublished (2003) 
Contact: Ina Rolfs 

RZPD Deutsches Ressourcenzentrum fuer Genomf orschung GmbH 
Im Neuenheimer Feld 580, D-69120 Heidelberg, Germany 



Peters, M., Radelof,U., Schneider, D. 



FEATURES 

source 



ORIGIN 



RZPD; IMAGp998N017639. 

RZPDLIB; I.M.A.G.E. cDNA Clone Collection; 
Mouse UnigeneSet - RZPD2 (RZPDLIB No. 981) 
http : //www. rzpd.de/CloneCards/cgi- 

bin/showLib.pl.cgi/response?libNo=981 Contact: Ina Rolfs 
RZPD Deutsches Ressourcenzentrum fuer Genomf orschung GmbH 
Heubnerweg 6, D-14059 Berlin, Germany 
Tel: +49 30 32639 101 
Fax: +49 30 32639 111 
www. rzpd.de 

This clone is available royalty-free from RZPD; 

contact RZPD (clone@rzpd.de) for further information. Seq primer: 
SP6, Primer sequence: ATTTAGGTGACACTATAG . 

Location/ Qualifiers 

1. ,488 

/organism="Mus mus cuius" 
/mol_type="mRNA" 
/strain="129,C57BL/6J, FVB/N" 
/db_xref="taxon: 10090" 

/clone="IMAGp998N017639 ; IMAGE: 3153984" 
/tissue_type="tumor, gross tissue" 
/dev_stage="10 months" 
/lab_host="DH10B" 
/clone_lib="NCI_CGAP_Mam3 " 

/note="Organ: mammary; Vector: pCMV-SP0RT6; Site_l: Sail; 
Site_2: NotI; Cloned unidirectionally . Primer: Oligo dT^ 
Library constructed by Life Technologies. Investigators 
providing samples: Lothar Hennighausen/Chu-Xia Deng, NIH 
Reference for transgenic model: Xu et al . , Nature Genetics 
22, 37-43 (1999) ." 



Query Match 35.5%; Score 318.6; DB13; Length 488; 

Best Local Similarity 86.7%; Pred. No. 9.7e-55; 

Matches 351; Conservative 0; Mismatches 54; Indels 0; Gaps 0; 



Qy 


469 


GTGGGTGAGAAGCAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTAC 

N 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 II II M 1 M 1 1 1 1 1 1 1 1 1 I 

GTGGGTGAGAAGCAGTCGCTCAAGTGTGAGGCAGCGGCGGGAAACCCCCAGCCCTCCTAT 


528 


Db 


1 


60 


Qy 


529 


CGTTGGTTCAAGGATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGC 

1 IIIIMIIIII IIIIMM II II II lllllliiiii iiMII 

CGCTGGTTCAAGGATGGCAAGGAACTCAACCGGAGTCGTGATATTCGCATCAAGTATGGC 


588 


Db 


61 


120 


Qy 


589 


AACGGCAGAAAGAACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAG 

II N 1 II II 1 1 M II M M 1 M II 1 1 1 1 1 1 M 1 II MM 

AATGGCAGAAAGAACTCACGGCTACAGTTCAACAAAGTGAGGGTGGAGGATGCCGGGGAG 


648 


Db 


121 


180 


Qy 


649 


TATGTCTGCGAGGCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTC 

1 1 1 1 1 N II II II 1 M II 1 II 1 1 1 1 II 1 1 1 II 1 II II 1 M 1 1 1 1 1 1 

TACGTCTGTGAGGCCGAGAACATCCTTGGGAAGGACACCGTGAGGGGCCGACTCCATGTC 


708 


Db 


181 


240 


Qy 


709 


AACAGCGTGAGCACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACA 

1 1 1 1 N 1 II II II M II II II 1 M II 1 II 1 II M M 1 II 1 M 1 1 II 1 II 1 Mill 

AACAGCGTGAGCACCACTCTGTCATCCTGGTCGGGACATGCCCGGAAGTGCAATGAGACC 


768 


Db 


241 


300 


Qy 


769 


GCCAAGTCCTATTGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTC 


828 



ill!!!lilli^!L' ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 

.GCTC 360 



301 GCCAAGTCCTACTGTGTGAATGGAGGCGTGTGCTACTACATCGAGGGCATCAACCAi 



829 TCCTGCAAGTGTCCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 
N I I I I I I I I I I I I I I I I III I I I I I I III 

TCCTGC7\AATGTCCAAACGGATTCTTCGGACAGAGATGTTTGGAG 4 05 



RESULT 9 

BI410828/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BI410828 949 bp mRNA linear EST 14-AUG-2001 

602963734F1 NCI_CGAP_Lu33 Mus musculus cDNA clone IMAGE : 5119065 5* 
mRNA sequence, ' 
BI410828 

BI410828.1 01:15171751 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 
1 (bases 1 to 949) 
NIH-MGC http://mgc.nci, nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished (1999) 

Contact: Robert Strausberg, Ph.D, 

Email : cgapbs-rtimail . nih . gov 

Tissue Procurement: Gilbert Smith, Ph.D, 

cDNA Library Preparation: M. Bento Scares, Ph.D., M. Fatima 
Bonaldo, Ph,D. 

cDNA Library Arrayed by: The I,M.A.G.E, Consortium (LLNL) 
DNA Sequencing by:Incyte Genomics, Inc. 

Clone distribution: NCI-CGAP clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
http : / / image . llnl , gov 



10 



Plate: LLAM11290 row: d column: 
High quality sequence start: 28 
High quality sequence stop: 919. 

Location/Qualifiers 

1. .949 

/organism="Mus musculus" 
/mol_t ype= "mRNA" 
/strain="C2ech II" 
/db__xref="taxon: 10090" 
/clone="IMAGE: 5119065" 
/tissue_type="pooled lung tumors" 
/lab_host="DH10B (phage-resistant ) " 
/ clone_lib= "NCI_CGAP_Lu33 " 

/note="Organ: lung; Vector: pT7T3D-Pac (Pharmacia) with a 
modified polylinker; Site_l: NotI; Site_2: EcoRI; 1st 
strand cDNA was prepared from mRNA obtained from pooled 
lung tumors with a Not I - oligo(dT) primer [5* 
TGTTACCAATCTGAAGTGGGAGCGGCCGCCTCTGTTTTTTTTTTTTTTTTT 3 » ] . 
Double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia), digested with Not I and cloned into the Not 
I and Eco RI sites of the modified pT7T3 vector. Library 
went through one round of normalization, and was 
constructed by Bento Soares and M. Fatima Bonaldo. " 



ORIGIN 



Query Match 28.4%; Score 255; DB12; Length 949; 

Best Local Similarity 80.0%; Pred. No. 1.3e-41; 

Matches 388; Conservative 0; Mismatches 85; Indels 12; Gaps 7; 
Qy 397 GTGGGCAAGATCCTGTGCACTGACTGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGC 456 

,,111111 I II mil MM IMMMN MMI MMMMMMMI I 

Db 947 GTGGGCCAGATCCTGGGCACTG-CTGCGCCACCCGCCCCAA-CTGAAGAAGATGAAGA-C 891 

Qy 457 CAGACGGGACAGGTGGGTGAGAAGCAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCC 516 

,,1111 II MM M M I M M M M M M I I 

Db 8 90 CAAACCAGAAGAGTCGGTGAGAACAGTTCGCTCAAGTGTGAGGCACGGCCGGGGAAACCC 831 
Qy 517 CAGCCTTCCTACC GTTGGTTCAAGGATGGC-AAGGAGCTCAACCGCAGCCGAGAC 570 

I I I I I I I I I I M M M M M I M M I M M M M 

Db 830 CCCCACCCCTCCCTATCGCTGGTTTCAAGGATGGCAAAGGAACTCAACCGGAGTCGTGAT 771 

Qy 571 ATTCGCATCAAATATGGCAACGGCAGAAAGAACTCACGACTACAGTTCAACAAGGTGAAG 630 

I I I I I I I I I N INI Ml M M M I I I M M II II II M II M II M I I || 

ATTCGCATCAAGTATGCCAATGGCAGAAAGAACTCACGGCTACAGTTCAACAAAAGTGAG 



Db 770 



711 



Qy 631 GT— GGAGGACGCTGGGGAGTATGTCTGCGAGGCCGAGAACATCCTGGGGAAGGACACCG 688 

II I I N N I I I N I I I I II M I I I I II M I M M II I I I I I Ml I I II I 

GTTGGAGGATTGCCGGGGAGTACGTCTGTGAGGCCGAGAACATCCTTGGGAAGGACACCG 651 



Db 710 



Qy 689 TCCGGGGCCGGCTTTACGTCAACAGCGTGAGCACCACCCTGTCATCCTGGTCGGGGCACG 74 8 

ii,lL_l_i!l I I I I I I I I I I I I I M M M II I M I I II I II II I II I 

ATG 592 



Db 650 T-GAGGGCCGACTCCATGTCAACAGCGTGAGCACCACTCTGTCATCCTGGTCGGGAC. 



Qy 749 
Db 591 



CCCGGAAGTGCAACGAGACAGCCAAGTCCTATTGCGTCAATGGAGGCGTCTGCTACTACA 8 08 

I I I I I I I I I I I N Mill III M II M I I I II II II I I II I M I M 

CCCGGAAGTGCT^ATGAGACCGCCAAGTCCTACTGTGTGAATGGAGGCGTGTGCTACTACA 532 
Qy 8 09 TCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCCTGTGGGATACACCGGGGACAGGTGTC 868 

I I I I N I I M II I M M I II I I II I II I I II M Mill III I II M I 

TCGAGGGCATCAACCAGCTCTCCTGCAAATGTCCAAACGGATTCTTCGGACAGAGATGTT 472 



Db 531 



Qy 869 AGCAG 873 

I I I 

Db 4 71 TGGAG 4 67 



RESULT 10 

BE983573 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



BE983573 333 fop ^^j^ linear EST 29-APR-2002 

UI-M-CGOp-bgi-c-07-O-UI.sl NIH_BMAP_Ret4_S2 Mus musculus cDNA clone 
UI-M-CGOp-bgi-c-07-O-UI 3\ mRNA sequence. 
BE983573 

BE983573.1 GI:10654893 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 333) 

Bonaldo^M.F. , Lennon.G. and Scares, M.B. 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



two approaches to facilitate gene 



FEATURES 

source 



MD 



ORIGIN 



Normalization and subtraction: 
discovery 

Genome Res. 6 (9), 791-806 (1996) 
97044477 
8889548 

Contact: Chin, H 

National Institute of Mental Health 

6001 Executive Blvd. Room 7N-7190, MSG 9643, Bethesda, 
20892-9643, USA 
Tel: 301 443 1706 
Fax: 301 443 9890 
Email: mEST@mail.nih.gov 

Oligo-dT track not found. Not I site shown in beginning of sequence 
is likely internal to the message. cDNA Library Preparation: M.B. 
Soares Lab Clone distribution: Researchers may obtain BMAP cDNA 
clones from RESEARCH GENETICS. It should be noted that Bento Soares 
IS generating a small number of additional specialized 
non-redundant arrays of BMAP cDNAs whose availability will be 
considered under appropriate and limited collaborative arrangements 
The tissue for this library was contributed by Dr. Xin-Yuan Fu, 
Yale University School of Medicine The following repetitive 
elements were found in this cDNA sequence: 15-105, 
>GC_rich#Low_complexity 
Seq primer: M13 Forward 
POLYA=No . 

Location/Qualifiers 
1. .333 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/db_xref="taxon: 10090" 

/clone="UI-M-CG0p-bgi-c-07-0-UI" 

/lab_host="DH10B (Life Technologies)" 

/ clone_lib="NIH_BMAP_Ret4_S2" 

/note="Vector: pT7T3D-Pac~ (Pharmacia) with a modified 
polylinker; Site_l: Not I; Site_2 : Eco RI; The 
NIH_BMAP_Ret4_S2 library is a subtracted library, 
ultimately derived from mouse retina tissue libraries at 
various stages of development. For a detailed description 
of the library from which this clone was derived, please 
visit our web site at brainest . eng , uiowa . edu . The tissue 
for this library was contributed by Dr. Xin-Yuan Fu, Yale 
University School of Medicine 
TAG_SEQ=None found" 



Query Match 26. 1%; 

Best Local Similarity 95.6%; 
Matches 241; Conservative 



Qy 

Db 

Qy 

Db 



Score 234.4; DB 10; 
Pred. No. 1.3e-37; 
0; Mismatches 11; 



Length 333; 
Indels 0; 



Gaps 



82 



0; 



ATGAGGCGCGACCCGGCCCGCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

ATGAGGCGCGACCCGGCCCCCGGCTTCTCGATGCTGCTCTTCGGTGTGTCACTCGCCTGC 141 



61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

I I I I I I I I I I I I I I I I I I I I III M II II I I I I I I I I I I M I I I I II II I I I 

142 TACTCGCCCAGCCTCAAGTCGGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 201 



Qy 


121 


GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III II 1 1 1 1 1 III..... .... 

'''''''' 1 1 1 1 1 1 1 1 1 1 III II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II II 1 1 1 1 1 |. 1 1 1 1 1 
GGCAAGGTACAGGGACTGGCCCCGGCAGGCGGTTCCAGCTCTAACAGCACCCGAGAGCCG 


180 


Db 


202 


261 


Qy 


181 


CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCT(^r;Ar AAr;Tr;f::rr'rrTrrrrnr'nr-r-^ 


240 


Db 


262 


' 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCCGCCTCGGGTCGGGTGGCGCTGGTGAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 


321 


Qy 


241 


GGGCTGCAGCGC 252 
M 1 1 1 1 1 1 1 1 1 1 
GGGCTGCAGCGC 333 




Db 


322 





RESULT 11 

AA772412 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



^^^^^'^^ 297 bp mRNA linear EST 31-DEC-1998 

ai44el2.sl Soares_parathyroid_tuinor_NbHPA Homo sapiens cDNA clone 
1359886 3» similar to TR:P43328 P43328 NEU DIFFERENTIATION FACTOR 
NDF04 mRNA sequence. 
AA772412 

AA772412, 1 01:2824195 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 297) 

NCI-CGAP http : / /www . ncbi . nlm, nih . gov/ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) 
Tumor Gene Index ' 
Unpublished ( 1997 ) 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r @mail . nih . gov 

cDNA Library Preparation: M. Bento Soares, Ph.D., M. Fatima 
Bonaldo, Ph.D. 
CDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing Center 
Clone distribution: NCI-CGAP clone distribution information can be 

found through the I.M.A.G.E. Consortium/LLNL at: 

www-bio . llnl . gov/bbrp/image/image . html 

Possible reversed clone: similarity on wrong strand 
Possible reversed clone: polyT not found 
Insert Length: 667 Std Error: 0.00 
Seq primer: -40ml3 fwd. ET from Amersham 
High quality sequence stop: 267. 

Location/Qualifiers 

1. .297 

/organism="Homo sapiens" 
/mol_t ype= "mRNA" 
/db_xref="taxon: 9606" 
/clone="1359886" 

/tissue_type="parathyroid tumor" 
/dev_stage="adult" 

/lab_host="DH10B (ampicillin resistant)" 
/ clone_lib="Soares_parathyroid_tumor_NbHPA" 



/note="Organ: parathyroid gland; Vector: pT7T3D 
(Pharmacia) with a modified polylinker; Site_l: Not I; 
Site_2: Eco RI; 1st strand cDNA was primed with a Not I - 
oligo(dT) primer 

[ 5 ' -TGTTACCAATCTGAAGTGGGAGCGGCCGCACCAATTTTTTTTTTTTTTTTTTTT 
TTTTT-3'], double-stranded cDNA was size selected, ligated 
to Eco RI adapters (Pharmacia), digested with Not I and 
cloned into the Not I and Eco RI sites of a modified pT7T3 
vector (Pharmacia) . Library went through one round of 
normalization to a Cot = 5. Library constructed by Bento 
Scares and M.Fatima Bonaldo, RNA from sporadic parathyroid 
adenomas was kindly provided by Dr. Stephen Marx, National 
Institute of Diabetes and Digestive and Kidney Diseases 
NIH. " 

ORIGIN 



Query Match 25.0%; Score 224; DB 9; Length 297; 

Best Local Similarity 92.2%; Pred. No, l,7e-35; 

Matches 236; Conservative 0; Mismatches 20; Indels 0; Gaps 0 
Qy 4 05 GATCCTGTGCACTGACTGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGG 4 64 

, I I I I I III I II I I I M I I II M I I I I I II I I I I I M I 

Db 42 GAGCCCGATCCCGGGAGAAAGCCACCCGGCCAAGTTGAAGAAGATGAAGAGCCAGACGGG 101 

Qy 4 65 ACAGGTGGGTGAGAAGCAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTC 524 

I I I I N I I M I I I I I I I II M I I I I I II I I I I I I I M I I I I I I I I I II I I I I I I I I I 
Db 102 ACAGGTGGGTGAGAAGCAATCGCTGAAGTGTGAGGCAGCAGCGGTGAATCCCCAGCCTTC 161 

^25 CTACCGTTGGTTCAAGGATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATA 584 

N M I I I II I I I I I I I M I I I I I I I I I I I II I I I M I M I I I I I I I I I I I I I I I I I I I M 

CTACCGTTGGTTCAAGGATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATA 221 
Qy 585 TGGCAACGGCAGAAAGAACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGG 644 

IIIIIIMIIIIIIIIIIMIIIIIIMlliiiilll M I I M I I I I M I I I I I 

222 TGGCAACGGCAGAAAGAACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGG 281 

Qy 645 GGAGTATGTCTGCGAG 660 

M M I I I I I II I I I I I 
Db 282 GGAGTATGTCTGCGAG 297 



RESULT 12 

BI651936 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



BI651936 795 bp mRNA linear EST 12-SEP-2001 

603298677F1 NCI_CGAP_Mam3 Mus musculus cDNA clone IMAGE : 5339251 5*, 
mRNA sequence. 
BI651936 

BI651936,1 GI:15566172 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 795) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished (1999) 



COMMENT Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Lothar Hennighausen Ph.D., Chu-Xia Deng Ph.D. 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
http : //image . llnl . gov 
Plate: LLAM11861 row: j column: 20 
High quality sequence stop: 795. 
FEATURES Location/Qualifiers 
source 1. ,795 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="129, C57BL/6J, FVB/N" 

/db_xref="taxon: 10090" 

/clone="IMAGE: 5339251" 

/tissue_type="tumor, gross tissue" 

/dev_stage="10 months" 

/lab_host="DH10B" 

/ cl one_lib= "NCI_CGAP_Mam3 " 

/note="Organ: mammary; Vector: pCMV-SPORT6; Site_l: Sail; 
Site_2: NotI; Cloned unidirectionally . Primer: Oligo dT. 
Library constructed by Life Technologies. Investigators 
providing samples: Lothar Hennighausen/Chu-Xia Deng, NIH 
Reference for transgenic model: Xu et al . , Nature Genetics 
22, 37-43 (1999) . " 

ORIGIN 

Query Match 23.4%; Score 210.2; DB 12; Length 795; 

Best Local Similarity 86.2%; Pred. No. 1.7e-32; 

Matches 244; Conservative 0; Mismatches 38; Indels 1; Gaps 1; 
Qy 592 GGCAGAAAGAACTCACGAC-TACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTA 650 

I I I I I I I M I M I I M I I IIIMIMIIIII MM MMMMI M MMMM 

GGCAGAAAGAACTCACGGCTTACAGTTCAACAAAGTGAGGGTGGAGGATGCCGGGGAGTA 6 0 



Db 1 



TGTCTGCGAGGCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAA 710 

'Mil M M M M I M M M M M M M M I M I M M M M I M I M M I 

CGTCTGTGAGGCCGAGAACATCCTTGGGAAGGACACCGTGAGGGGCCGACTCCATGTCAA 12 0 



Db 61 



Qy 711 CAGCGTGAGCACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGC 770 

M I I M M I M M I I I M M M M M M M M M M M I M M M M I M M I M 

CAGCGTGAGCACCACTCTGTCATCCTGGTCGGGACATGCCCGGAAGTGCAATGAGACCGC 18 0 



Db 121 



Qy 771 CAAGTCCTATTGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTC 830 

I I M M M I M M M M I M M M M M M I M M M M M M M M M I 

CAAGTCCTACTGTGTGAATGGAGGCGTGTGCTACTACATCGAGGGCATCAACCAGCTCTC 240 



Db 181 



Qy 831 CTGCAAGTGTCCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

I I I M I I I I I I Ml I M II I III 

241 CTGCAAATGTCCAAACGGATTCTTCGGACAGAGATGTTTGGAG 2 83 



RESULT 13 
BE648780 



LOCUS BE648780 259 bp mRNA linear EST 06-SEP-2000 

DEFINITION UI-M-BH2.2-aop-b-12-0-UI.rl NIH__BMAP_M_S3 , 2 Mus musculus cDNA clone 

UI-M-BH2.2-aop-b-12-0-UI 5', mRNA sequence. 
ACCESSION BE64 8 78 0 
VERSION BE648780.1 GI:9974601 

KEYWORDS EST. 

SOURCE Mus musculus (house mouse) 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 
REFERENCE 1 (bases 1 to 259) 

AUTHORS Bonaldo,M.F. , Lennon^G. and Scares, M.B. 

TITLE Normalization and subtraction: two approaches to facilitate gene 

discovery 

JOURNAL Genome Res. 6 (9), 791-806 (1996) 
MEDLINE 97044477 
PUBMED 8889548 
COMMENT Contact: Chin, H 

National Institute of Mental Health 

6001 Executive Blvd. Room 7N-7190, MSC 9643, Bethesda, MD 

20892-9643, USA 

Tel: 301 443 1706 

Fax: 301 443 9890 

Email : mEST@mail . nih . gov 

cDNA Library Preparation: M.B. Scares Lab Clone distribution: 
Researchers may obtain BMAP cDNA clones from RESEARCH GENETICS. It 
should be noted that Bento Scares is generating a small number of 
additional specialized non-redundant arrays of BMAP cDNAs whose 
availability will be considered under appropriate and limited 
collaborative arrangements 
Seq primer: M13 Reverse. 
FEATURES Location/Qualifiers 
source 1. .259 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/ db_x r e f = " t axon : 1 0 0 9 0 " 

/clone="UI-M-BH2.2-aop-b-12-0-UI" 

/dev_stage="27-32 days" 

/lab_host="DH10B (Life Technologies)" 

/clone_lib="NIH_BMAP_M_S3 . 2 " 

/note="Vector : pT7T3D-Pac (Pharmacia) with a modified 
polylinker; Site__l: Not I; Site_2: Eco RI ; The 
NIH_BMAP_M_S3.2 library is a subtracted library of a 
series, ultimately derived from a mixture of individually 
tagged normalized libraries from ten regions of the mouse 
brain (cerebellum, brain stems, olfactory bulbs, 
hypothalamus, cortex, amygdala, basal ganglia, pineal 
gland, striatum, hipoccampus) after a series of 
subtractions to reduce the representation of cDNAs from 
which ESTs had already been generated. The following 
serially subtracted libraries were generated in this 
process: NIH_BMAP_M_S3 . 2, NIH_BMAP_M_S2 , NIH_BMAP_M_S1 . 
The subtracted library (NIH_BMAP_M_S3 . 2 ) was constructed 
as follows: PCRamplified cDNA inserts from NIH_BMAP_M_S2 
clones from which 3' ESTs had been derived was used as a 
driver in a hybridization with the NIH_BMAP_M_S2 library 



in the form of single-stranded circles. The remaining 
single-stranded circles (subtracted library) was purified 
by hydroxyapatite column chromatography, converted to 
double-stranded circles and electroporated into DHIOB 
bacteria (Lif eTechnologies ) to generate the 
NIH_BMAP__M_S3. 2 library. This procedure has been 
previously described (Bonaldo, Lennon and Scares, Genome 
Research 6:791-806, 1996)" 



ORIGIN 



Query Match 21.8%; Score 195.2; DB 10; Length 259; 

Best Local Similarity 90.1%; Pred. No. 1.2e-29; 

Matches 2 09; Conservative 0; Mismatches 23; Indels 0; Gaps 0; 
Qy 663 CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 722 

I I M I I I I I I I I I M I I M M M I I I I I I I I I I I I I I I I I M M I I I I I 

Db 1 CCACAACATCCTTGGGAAAGACACCGTGAGGGGCCGACTCCATGTCAAGAGCGTGAGCAC 60 

Qy 723 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 7 82 

III I II I I I I II I M I M I I II I I I I I I I I I II M I I I I I I I I I M I I I I I I II 
Db 61 CACTCTGTCATCCTGGTCGGGACATGCCCGGAAGTGCAATGAGACCGCCAAGTCCTACTG 12 0 

Qy 783 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCC 842 

II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 121 TGTGAATGGGGGCGTGTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCC 180 

Qy 843 TGTGGGATACACCGGGGACAGGTGTCAGCAGTTCGCAATGGTCAACTTCTCC 8 94 

I I I I I I I I I I I I I I I MM I M I II I I I I I I M M M I I M I M I 11 I I I 

Db 181 TGTGGGATACACCGGTGACAAGTGTCAGCAGTTCGCAATGGTCAACTTCTCC 232 



RESULT 14 

AA968077 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



AA968077 327 bp mRNA linear EST 19-MAY-1998 

uh09h01.rl Scares mouse hypothalamus NMHy Mus musculus cDNA clone 
IMAGE:1617457 5' similar to TR:O35073 035073 NTAK ALPHA2-1P ;, mRNA 
sequence . 
AA968077 

AA968077.1 GI:3141970 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus, 
1 (bases 1 to 327) 

Marra,M., Hillier,L., Allen, M., Bowles, M., Dietrich, N., Dubuque, T., 
Geisel,S., Kucaba,T., Lacy,M. , Le,M., Martin, J., Morris, M., 
Schellenberg,K. , Steptoe,M., Tan,F., Underwood, K. , Moore, B., 
Theising,B., Wylie,T., Lennon, G., Scares, B., Wilson, R. and 
Waterston, R. 

The WashU-HHMI Mouse EST Project 
Unpublished (1996) 

Contact: Marra M/Mouse EST Project 

WashU-HHMI Mouse EST Project 

Washington University School of MedicineP 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

Tel: 314 286 1800 



Fax: 314 286 1810 

Email: mouseest@watson , wustl . edu 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI: 956757 

Seq primer: -2 8ml3 rev2 ET from Amersham 
High quality sequence stop: 318. 
FEATURES Location/Qualifiers 
source 1. .327 

/organism="Mus musculus" 

/mol_type="mRNA" 

/db_xref="taxon: 10090" 

/clone="IMAGE: 1617457" 

/tissue_type="hypothalamus" 

/lab_host="DH10B" 

/clone_lib="Soares mouse hypothalamus NMHy" 

/note="Organ: brain; Vector: pT7T3D-Pac (Pharmacia) with a 
modified polylinker; Site_l : Not I; Site_2: Eco RI; 1st 
strand cDNA was primed with a Not I - oligo (dT) primer [5* 
TGTTACCAATCTGAAGTGGGAGCGGCCGCCAAGGTTTTTTTTTTTTTTTTTTTTTTTT 
T 3*]; double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia), digested with Not I and cloned into the Not I 
and Eco RI sites of the modified pT7T3 vector. RNA 
provided by Dr. Wolfgang Liedtke. Library went through 
two rounds of normalization, and was constructed by Bento 
Scares and M.Fatima Bonaldo." 

ORIGIN 

Query Match 21.8%; Score 195.2; DB 9; Length 327; 

Best Local Similarity 90.2%; Pred. No. 1.4e-29; 

Matches 231; Conservative 0; Mismatches 23; Indels 2; Gaps 2; 

Qy 641 CTGGGGAGTATGTCTGCGAGGCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGC 700 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I III II I 
2 CCGGGGAGTACGTCTGTGAGGCCCAGAACATCCTTGGGAAGGACACCGTGAGGGA-CGAC 60 

Qy 701 TTTACGTCAACAGCGTGAGCACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCA 760 

I I II I I I I I I II I II I I II I I I I M I II I I I I II II II II I I I I I I I I I I I I I 
Db 61 TCCATGTCAACAGCGTGAG-ACCACTCTGTCATCCTGGTCGGGACATGCCCGGAAGTGCA 119 

Qy 761 ACGAGACAGCCAAGTCCTATTGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCA 82 0 

I I I I I I I I I I I I M II I II II I I I I I I M I I I II I I I I I I I I I I I II I I I I I M 
Db 120 ATGAGACCGCCAAGTCCTACTGTGTGAATGGAGGCGTGTGCTACTACATCGAGGGCATCA 17 9 

Qy 821 ACCAGCTCTCCTGCAAGTGTCCTGTGGGATACACCGGGGACAGGTGTCAGCAGTTCGCAA 880 

I M I I I I I I I M I I M I I I I M I I I I I II I I I I II I I I II I I I I I I I I I I I I I I M I M I 

Db 180 ACCAGCTCTCCTGCAAGTGTCCTGTGGGATACACCGGGGACAGGTGTCAGCAGTTCGCAA 23 9 

Qy 881 TGGTCAACTTCTCCTA 8 96 

I I I I I I I I I I I I I I 
Db 240 TGGTCAACATCTCCAA 255 



RESULT 15 
BX089049/C 

LOCUS BX089049 362 bp mRNA linear EST 23-JAN-2003 

DEFINITION BX089049 Soares_parathyroid_tumor_NbHPA Homo sapiens cDNA clone 



IMAGp998M133119 ; IMAGE : 124 0116 , mRNA sequence. 
ACCESSION BX089049 

VERSION BX089049.1 GI:27825909 

KEYWORDS EST. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 362) 

AUTHORS Ebert,L., Heil,0., Hennig,S,, Neubert,P., Partsch,E., Peters, M., 

Radelof,U., Schneider, D. and Korn,B. 
TITLE Human UnigeneSet - RZPD3 

JOURNAL Unpublished (2003) 
COMMENT Contact: Ina Rolfs 

RZPD Deutsches Ressourcenzentrum fuer Genomf orschung GmbH 
Im Neuenheimer Feld 580, D-69120 Heidelberg, Germany 
RZPD; ■lMAGp998M133119. 

RZPDLIB; I.M.A.G.E. cDNA Clone Collection; 
Human UnigeneSet - RZPD3 (RZPDLIB No. 972) 
http : / /www. rzpd.de/CloneCards/cgi- 

bin/showLib.pl.cgi/response?libNo=972 Contact: Ina Rolfs 
RZPD Deutsches Ressourcenzentrum fuer Genomf orschung GmbH 
Heubnerweg 6, D-14059 Berlin, Germany 
Tel: +49 30 32639 101 
Fax: +49 30 32639 111 
www, rzpd.de 

This clone is available royalty-free from RZPD; 

contact RZPD (clone@rzpd.de) for further information. Seq primer: 
M13r, Primer sequence: TTTCACACAGGAAACAGCTATGAC . 
FEATURES Location/Qualifiers 
source 1. .362 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 

/clone="IMAGp998M133119 ; IMAGE : 1240116" 
/ tissue_type="parathyroid tumor" 
/dev_stage= "adult" 

/lab_host="DH10B (ampicillin resistant)" 
/ clone_lib="Soares_parathyroid_tumor_NbHPA" 
/note="Organ: parathyroid gland; Vector: pT7T3D 
(Pharmacia) with a modified polylinker; Site_l: Not I; 
Site_2: Eco RI; 1st strand cDNA was primed with a Not I - 
oligo(dT) primer 

[5 * -TGTTACCAATCTGAAGTGGGAGCGGCCGCACCAATTTTTTTTTTTTTTTTTTTT 
TTTTT-3*]/ double-stranded cDNA was size selected, ligated 
to Eco RI adapters (Pharmacia), digested with Not I and 
cloned into the Not I and Eco RI sites of a modified pT7T3 
vector (Pharmacia) . Library went through one round of 
normalization to a Cot = 5. Library constructed by Bento 
Soares and M.Fatima Bonaldo. RNA from sporadic parathyroid 
adenomas was kindly provided by Dr. Stephen Marx, National 
Institute of Diabetes and Digestive and Kidney Diseases, 
NIH. " 

ORIGIN 



Query Match 20.4%; Score 182.6; DB 13; Length 362; 

Best Local Similarity 85.3%; Pred. No. 5.3e-27; 



Matches 203; Conservative 0; Mismatches 35; Indels 0; Gaps 0; 

Qy 656 GCGAGGCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCG 715 

M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I M 
Db 362 GCGAGGCCGAGAACATCCTGGGGAAGGACACCGTCCGGGNCCGGCTTTACGTCAACAGCG 303 

Qy 716 TGAGCACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGT 775 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I 
Db 302 TGAGCACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGT 243 

Qy 776 CCTATTGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCA 835 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 242 CCTATTGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCA 183 

Qy 836 AGTGTCCTGTGGGATACACCGGGGACAGGTGTCAGCAGTTCGCAATGGTCAACTTCTC 893 

M I I I I III II II I II I I I I M I I 

Db 182 AGGCACATGGGCTGCACTGCTTAGAACTTGGTACCCAGAGCCACCACTTCCCCATCTC 125 

Search completed: August 15, 2004, 09:42:30 
Job time : 2850.01 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



August 15, 2004, 02:48:13 ; Search time 3828.5 Seconds 

(without alignments) 
10155.083 Million cell updates/sec 

US-09-864-675-3 
897 

1 atgaggcgcgacccggcccc caatggtcaacttctcctaa 897 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



3470272 seqs, 21671516995 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



6940544 



Database 



GenEmbl : * 

1 : gb_ba : + 

2 : gb_htg : * 

3 : gb_in : 

4 : gb_om : * 

5 : gb_o V : * 

6: gb_pat:* 

7 : gb_ph : * 

8 : gb_pl : * 

9: gb_pr:* 
10: gb_ro:^ 
11: gb_sts:* 
12: gb_sy:* 
13: gb__un:* 
1 4 : gb_ vi : * 
15: em_ba:^ 
1 6 : em_f un : * 
1 7 : em__hum : * 
18: em_in:^ 
1 9 : em__mu : 
2 0 : em__om : * 
21: em_or:* 
22: em_ov:* 
23: em_pat:* 
24: em_ph:* 
25: em_pl:* 
26: em_ro:* 
27: em sts:* 



28 






29 


em 


vi ; 


30 


em 


ht cf hum " * 


31 


sm 


XX^^ _I_11V ■ 


32 


Bm 


h"t"rf n i" h "T • 


33 


em 


Irh rf Tni i c; • + 

i J. ^ ilLLi. o • 


34 


em 


htfT d1 n ' 


35 


em 


XX O y J_ <i^V_4. ■ 


36 


em 


htg mam:^ 


37 


em 


_htg_vrt : * 


38 


em 


sy : * 


39 


em 


htgo hum:* 


40 


em 


htgo mus : * 


41, 


em 


htgo other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 
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Sequence 
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Rattus sp. 
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10 
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Rattus sp. 


10 
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Sequence 
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427.8 
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Sequence 


12 


427.8 
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Sequence 
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21 


405.4 


45 


2 


1607 


6 


AR098144 


AR098144 


Sequence 


22 


405.4 


45 


2 


1607 


6 


AR116616 


AR116616 


Sequence 


23 


405.4 


45 


2 


2467 


6 


AR098143 


AR098143 


Sequence 
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D89997 Rattus sp. 



ALIGNMENTS 



RESULT 1 
AB005060 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



AB005060 3020 bp mRNA linear PRI 14-NOV-1997 

Homo sapiens mRNA for NTAK, complete cds . 

A£005060 

AB005060. 1 GI:2626738 
NTAK. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (sites) 

Higashiyama, S . , Horikawa,M. Yamada^K., Ichino^N., Nakano^N., 
Nakagawa,T., Miyagawa^J., Matsushita^N . , Nagatsu,T., Taniguchi,N. 
and Ishiguro^H. 

A novel brain-derived member of the epidermal growth factor family 

that interacts with ErbB3 and ErbB4 

J. Biochem. 122 (3), 675-680 (1997) 

98006324 

9348101 

2 (bases 1 to 3020) 
Ishiguro, H . 

Direct Submission 

Submitted ( 24- JUN-1997 ) Hiroshi Ishiguro, Fujita Health University, 

ICMS; 1-98, kutsukake-cho, Toyoake, Aichi 470-11, Japan 

(E-mail: hishi@fujita-hu.ac.jp, Tel : 0562-93-9393, Fax : 0562-93-8 831 ) 

Location/ Qualifiers 

1. .3020 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

/cell_line="SK-N-SH" 

/ eel l_type= "neuroblastoma " 

226. .2778 

/codon_start=l 

/product="NTAK" 

/protein_id="Bi\A23417. 1" 

/db_xref="GI: 2626739" 

/translation="MRQVCCSALPPPPLEKGRCSSYSDSSSSSSERSSSSSSSSSESG 
SSSRSSSNNSSISRPAAPPEPRPQQQPQPRSPAARRAAARSRAAAAGGMRRDPAPGFS 



MLLFGVSLACYSPSLKSVQDQAYKAPVWEGKVQGLVPAGGSSSNSTREPPASGRVAL 
VKVLDKWPLRSGGLQREQVISVGSCVPLERNQRYIFFLEPTEQPLVFKTAFAPLDTNG 
KNLKKEVGKILCTDCATRPKLKKMKSQTGQVGEKQSLKCEAT^GNPQPSYRWFKDGKE 
LNRSRDIRIKYGNGRKNSRLQFNKVKVEDAGEYVCEAENILGKDTVRGRLYVNSVSTT 
LSSWSGHARKCNETAKSYCVNGGVCYYIEGINQLSCKCPNGFFGQRCLEKLPLRLYMP 
DPKQKAEELYQKRVLTITGICVALLWGIVCWAYCKTKKQRKQMHNHLRQNMCPAHQ 
NRSLANGPSHPRLDPEEIQMADYISKNVPATDHVIRRETETTFSGSHSCSPSHHCSTA 
TPTSSHRHESHTWSLERSESLTSDSQSGIMLSSVGTSKCNSPACVEARARRAAAYNLE 
ERRRATAPPYHDSVDSLRDSPHSERYVSALTTPARLSPVDFHYSLATQVPTFEITSPN 
SAHAVSLPPAAPISYRLAEQQPLLRHPAPPGPGPGPGPGPGPGADMQRSYDSYYYPAA 
GPGPRRGTCALGGSLGSLPASPFRIPEDDEYETTQECAPPPPPRPRARGASRRTSAGP 
RRWRRSRLNGLAAQRARAARDSLSLSSGSGGGSASASDDDADDADGALAAESTPFLGL 
RGAHDALRSDSPPLCP7W)SRTYYSLDSHSTRASSRHSRGPPPRAKQDSAPL" 
polyA_site 3020 

/note="39 A nucleotides" 

ORIGIN 

Query Match 94.6%; Score 849; DB 9; Length 3020; 

Best Local Similarity 98,3%; Pred. No. 5.5e-152; 

Matches 858; Conservative 0; Mismatches 15; Indels 0; Gaps 0; 
Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I I I I I I I M I M I I I I I I I I I I M I I I I I I I M I M I I I I I I I I I I I I I M I I I I I I I I 

Db 502 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 561 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I 
Db 562 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 621 

Qy 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I 
Db 622 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 681 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 682 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 741 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

I M I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 742 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGTW^GGAACCAG 801 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I M I I I I I I I 
Db 8 02 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 8 61 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCT^GAAAGAGGTGGGCAAGATCCTGTGCACTGAC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 862 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 921 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 4 80 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I 
Db 922 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 981 

Qy 481 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 54 0 

I M M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I M I I I I I 

Db 982 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 1041 



Qy 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAT^TATGGCAACGGCAGAAAG 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1042 GATGGCT^lAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 1101 

Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1102 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 1161 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 720 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I 

Db 1162 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 1221 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 78 0 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I { M I 

Db 1222 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 12 81 

Qy 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 84 0 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 1282 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGT 1341 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

II 1 1 1 1 I II I I II I II III 

Db 1342 CCAAATGGATTCTTCGGACAGAGATGTTTGGAG 1374 



RESULT 2 
AK124504 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



AK124504 2613 bp mRNA linear PRI 09-SEP-2003 

Homo sapiens cDNA FLJ42513 fis, clone BRACE204 6295, highly similar 
to NTAK PROTEIN. 
AK124504 

AK124504. 1 GI: 34530302 

oligo capping; fis (full insert sequence) . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Kawakami^B., Sugiyama^A. , Takemoto,M,, Sugiyama,T., Irie,R., 
Otsuki^T., Sato^H., Wakamatsu, A. , Ishii,S., Yamamoto,J., Isono,Y., 
Kawai-Hio, Y. , Saito^K., Nishikawa, T . , Kimura,K., Yamashita, H . , 
Matsuo^K., Nakamura,Y., Sekine^M., Kikuchi,H,, Kanda,K,, 
Wagatsuma,M, , Murakawa^K,, Kanehori^K., Takahashi-Fuj ii , A. , 
Oshima,A., Suzuki, Y., Sugano,S., Nagahari,K., Masuho,Y., Nagai,K. 
and Isogai,T. 

NEDO human cDNA sequencing project 

Unpublished 

2 (bases 1 to 2613) 

Isogai,T. and Yamamoto,J, 

Direct Submission 

Submitted (15-JUL-2003) Takao Isogai, FLJ Project (HRI Team); 2-6-7 
Kazusa-Kamatari, Kisarazu, Chiba 292-0818, Japan 

(E-mail : genomics @hri. CO. jp, Tel : 8 1-438-52-3975, Fax: 81-438-52-3986) 
NEDO human cDNA sequencing project supported by Ministry of 
Economy, Trade and Industry of Japan; cDNA full insert sequencing: 
Research Association for Biotechnology (RAB) ; cDNA library 
construction: Helix Research Institute (HRI) (supported by Japan 



Key Technology Center etc.); 5'- & 3* -end one pass sequencing: RAB, 
HRI, and Biotechnology Center, National Institute of Technology and 
Evaluation; clone selection for full insert sequencing: HRI and 
RAB; annotation: HRI and RAB. 
FEATURES Location/Qualifiers 
source 1. ,2613 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon:9606" 

/ cl one= " BRACE2 04 62 95" 

/ tissue_type="cerebelluin" 

/clone_lib="BRACE2" 

/note="cloning vector: pME18SFL3" 

ORIGIN 

Query Match 94.5%; Score 847.4; DB 9; Length 2613; 

Best Local Similarity 98.2%; Pred. No. l.le-151; 

Matches 857; Conservative 0; Mismatches 16; Indels 0; Gaps 0; 

Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I 
Db 436 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTTGCTCGCCTGC 495 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 120 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 4 96 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 555 

Qy 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 18 0 

. I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 556 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 615 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 616 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 675 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

M I I M I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 676 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 735 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I 
Db 736 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 795 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 42 0 

I I I I I I I I I I I I I M I I I I I M I M I M I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I 

Db 7 96 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 855 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
Db 8 56 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 915 

Qy 4 81 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 540 

I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 916 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 975 



Qy 



541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 



Db 



97 6 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 1035 



Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1036 7\ACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 1095 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 72 0 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I M M I I I I I I I I I I I I I I I I I I I I M I I 

Db 1096 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 1155 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 7 80 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1156 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 1215 

Qy 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 84 0 

I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I III 

Db 1216 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGT 1275 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

II Mill III MINI III 

Db 1276 CCAAATGGATTCTTCGGACAGAGATGTTTGGAG 1308 



RESULT 3 
AR098145 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



ORIGIN 



AR098145 1884 bp DNA linear 

Sequence 5 from patent US 6074841, 

AR098145 

AR098145.1 GI: 12 807402 

Unknown . 

Unknown . 

Unclassified. 

1 (bases 1 to 1884) 

Gearing, D. P. and Bus field, S . J. 

Don-1 gene and polypeptides and uses therefor 
Patent: US 6074841-A 5 13-JUN-2000; 

Location/Qualifiers 

1. .1884 

/ o r gani sm= " unknown " 
/mol_type="unas signed DNA" 



PAT 14-FEB-2001 



Query Match 93. 1%; 

Best Local Similarity 98.1%; 
Matches 856; Conservative 



Score 835.4; DB 6; 
Pred. No. 2.2e-149; 
0; Mismatches 16; 



Length 1884; 
Indels 1; 



Gaps 



1; 



Qy 

Db 



218 



ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I II I M II I I II I I I I I I II II I M I I I M I I II I I I II I I I I II I II I M I M M I M I 

ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 277 



Qy 



Db 



61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 120 

I I I I I I I I II I I I II I I I I I I I I I I I I I I I I II I M II I I I I I I M I I I I I II I I I I I I I 
27 8 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 337 



Qy 



121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 

I I I I I M I I M I I I I I M I I I I I I I I I II I I I I I I I M I I I I II I II I I I I I I M I I I I I 



180 



Db 



338 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 397 



Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

I I I I I I I I I I I I I I I I M I I M I I I M I M I I I I I M I I I I I I I I I I I I I M I I I I I I I I 

Db 398 CCCGCCTCGGGTCGGGTGGCGTTGGTT^GGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 457 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 458 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 517 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I.I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
Db 518 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 577 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 42 0 

MM! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I 
Db 578 CCCCT-GATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGGC 63 6 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 48 0 

II I I I I I I II I I I I M M I M II I M I I M II I I I II I II M I II I I M I I I II I I I I I I 

Db 637 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 696 

Qy 4 81 CT^TCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 540 

I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 
Db 697 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 756 

Qy 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 757 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 816 

Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I I I II I I I I II I I I I I I I I M I I I I I I I I I I I M I I II I I II I I I II I I I I I I II I I I I I 

Db 817 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 876 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 720 

II I I I I II I I I I I I I II I I I II I I I I I II I I I M I I I M I I I I I I II I II I III I I I I I I 

Db 877 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 936 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 780 

I I I I I I I I I I I I II I II I I I M I I I I I I I I I I I I I I I II I I I I I I M I I I II I M I I I I I 
Db 937 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 996 

Qy 7 81 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 84 0 

I I I I I M I I I I I I I M I I I M I I I I I I I I I I I M I I I II I II I I I II I II II I II I III 
Db 997 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGT 1056 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

M I I I I I III I I I I I I III 

Db 1057 CCAAATGGATTCTTCGGACAGAGATGTTTGGAG 1089 



RESULT 4 
AR116617 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 



AR116617 
Sequence 
AR116617 
AR116617. 



1884 bp DNA 
5 from patent US 6133423. 

1 GI:14096939 



linear PAT 16-MAY-2001 



KEYWORDS 

SOURCE Unknown . 

ORGANISM Unknown. 

Unclassified. 
REFERENCE 1 (bases 1 to 1884) 

AUTHORS Gearing, D. P. and Busf ield, S . J. 

TITLE Don-1 gene and polypeptides and uses therefor 

JOURNAL Patent: US 6133423-A 5 17-OCT-2000; 
FEATURES Location/Qualifiers 
source 1. .1884 

/ organism^ "unknown" 
/mol_type="unassigned DNA" 

ORIGIN 

Query Match 93,1%; Score 835.4; DB 6; Length 1884; 

Best Local Similarity 98.1%; Pred. No. 2.2e-149; 

Matches 856; Conservative 0; Mismatches 16; Indels 1; Gaps 1; 

Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 218 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 277 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I M 
Db 278 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 337 

Qy 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I 
Db 338 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 397 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 240 

I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I 
Db 398 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 457 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I M I I 
Db 458 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 517 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M M I I I I I M I I I I I I I I I 
Db 518 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 577 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGC7UVGATCCTGTGCACTGAC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I 
Db 578 CCCCT-GATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGGC 636 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 4 80 

I I I I I I I I M I I I I I I I I I I M I I M I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I 
Db 637 TGCGCCACCCGGCCCAAGTTGAAGAAGATG7UVGAGCCAGACGGGACAGGTGGGTGAGAAG 696 

Qy 4 81 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I IN I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 697 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 756 

Qy 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 757 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 816 



Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I M I I I I I I 
Db 817 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 876 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 877 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 936 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 780 

I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I 
Db 937 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 996 

Qy 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I III 

Db 997 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGT 1056 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

II I I I I I III I I I I I I III 

Db 1057 CCAAATGGATTCTTCGGACAGAGATGTTTGGAG 108 9 
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ORIGIN 



AR072052 3441 bp DNA 

Sequence 1 from patent US 5912326. 
AR072052 

AR072 052.1 GI: 7222 94 0 

Unknown. 

Unknown . 

Unclassified. 

1 (bases 1 to 3441) 

Chang, H. 

Cerebellum-derived growth factors 
Patent: US 5912326-A 1 15-JUN-1999; 

Location/Qualifiers 

1. .3441 

/organism= "unknown" 
/mol_type-"unas signed DNA" 



linear PAT 18-FEB-2000 



Query Match 87.4%; 
Best Local Similarity 92.2%; 
Matches 826; Conservative 



Score 784; DB 
Pred. No. 1.4e- 
0; Mismatches 



6; Length 3441; 
139; 

70; Indels 0; 



Gaps 



0; 



Qy 



Db 



1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 
I II I I I I I I I I II I I I I I II I I M I I I I I M I I I II I I I I II M II I I I I I I I I M I I 
18 0 ATGAGGCGCGACCCGGCCCCCGGCTTCTCGATGCTGCTCTTCGGTGTGTCACTCGCCTGC 239 



Qy 

Db 

Qy 

Db 



61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

II II I I I I II I I I I I I I I II M I I I I I I I I I I I I I I I II I I II I I I I I I I M I I M I I I 
240 TACTCGCCCAGCCTCAAGTCCGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 299 

121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCT^ACAGCACCCGAGAGCCG 18 0 

I I I M I II I I I I I I I I I I III II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
300 GGCAAGGTACAGGGACTGGCCCCGGCAGGCGGTTCCAGCTCTAACAGCACCCGAGAGCCT 359 



Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 240 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I 
Db 360 CCCGCCTCGGGTCGGGTGGCGCTGGTGAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 419 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

I I I I M I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I 
Db 42 0 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGCGCGCCGCTCGAAAGGAACCAG 47 9 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I M I I I I I I I M I II I I I I I I I 

Db 4 80 CGCTACATCTTTTTCCTGGAGCCCACCGAGCAGCCCTTAGTTTTTAAGACAGCCTTTGCC 539 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 420 

II I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I II I I I II I I I I I I 

Db 540 CCGGTCGACCCTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 599 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGT^AG 480 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I Ml I M I I I I MINI 

Db 600 TGCGCAACCCGGCCCAAGCTGAAGTVAGATGAAGAGTCAGACAGGAGAGGTGGGCGAGAAG 659 

Qy 481 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 54 0 

II Mill II II II I II I I II II II II I II I I I I I Mill II II M M M I 

Db 660 CAGTCGCTCAAGTGTGAGGCGGCGGCGGGGAACCCCCAGCCCTCCTATCGATGGTTCAAG 719 

Qy 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

II I M I I I II I I I I I I M I M II M M II I I I M I II M II I I II I M M II II I 

Db 720 GACGGCAAGGAGCTCAACCGGAGTCGTGACATTCGCATCAAGTATGGCAACGGCAGAAAG 779 

Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I I I II I I I M I I I I I I I M I I I I I II I II I M I I I I II II I I II I I I Mill Ml 
Db 7 80 AACTCACGGCTACAGTTCAACAAAGTGAAGGTGGAGGACGCTGGAGAGTACGTCTGTGAG 839 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 720 

M M II M I I I I I M I I I I I I I M II I I M I I II M I I II I I M I II I I I I 

Db 840 GCTGAGAACATCCTTGGGAAGGACACTGTGAGGGGCCGGCTCCATGTCAACAGTGTGAGC 899 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 780 

I I II I I I I II I I I I I M I M M M I M I M I I M I I I I I M II II I II M I I M M 

Db 900 ACCACTCTGTCGTCCTGGTCGGGGCACGCCCGGAAGTGCAATGAGACAGCCAAGTCCTAC 959 

Qy 7 81 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 840 

M M I I M I I I II M M I I I I I II I I I I I I I I M I I I I II I II I I M II I M I II 

Db 960 TGTGTGAATGGAGGCGTGTGCTACTACATCGAAGGCATCT^CCAACTCTCCTGCAAGTGT 1019 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAGTTCGCAATGGTCAACTTCTCCTA 896 

I I II I I I I I II I I I I M M I I I II I M I I I I I I II M I I I I II I I M M I II I I I 

Db 1020 CCTGTGGGATACACCGGGGACAGGTGTCAGCAGTTCGCAATGGTCAACTTCTCCAA 1075 
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AR072053 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 



AR072053 
Sequence 3 
AR072053 
AR072053. 1 



993 bp DNA 
from patent US 5912326. 

01:7222941 



linear PAT 18-FEB-2000 



SOURCE Unknown. 

ORGAN I SM Un known . 

Unclassified. 
REFERENCE 1 (bases 1 to 993) 

AUTHORS Chang, H. 

TITLE Cerebellum-derived growth factors 

J0URN7U. Patent: US 5912326-A 3 15-JUN-1999; 
FEATURES Location/Qualifiers 
source 1 . . 993 

/organism="unknown" 
/mol type="unassigned DNA" 
ORIGIN ~ 

Query Match 82.3%; Score 738.6; DB 6; Length 993; 

Best Local Similarity 90.4%; Pred, No. 6.8e-131; 

Matches 789; Conservative 0; Mismatches 84; Indels 0; Gaps 0; 

Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCGATGCTGCTCTTCGGTGTGTCACTCGCCTGC 60 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 61 TACTCGCCCAGCCTCAAGTCCGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

Qy 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 180 

I I I I I I I I M I I I I I I I I III II I I I I I I I I I I I II I I M I I I I I I I I I I I I I 
Db 121 GGCT^GGTACAGGGACTGGCCCCGGCAGGCGGTTCCAGCTCTAACAGCACCCGAGAGCCT 180 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I II I 
Db 181 CCCGCCTCGGGTCGGGTGGCGCTGGTGAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

I I I I I I I I I I II I I M I I II I I I I I M II I I I I I I II I I I I I I I I I I I I I I I I I I I II 
Db 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGCGCGCCGCTCG7WVGGAACCAG 300 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I I I I I II I I I I I I I I I M I II I I I I I II I I I I I I I I I I I I I I I II I I I M II I II I 

Db 301 CGCTACATCTTTTTCCTGGAGCCCACCGAGCAGCCCTTAGTTTTTAAGACAGCCTTTGCC 360 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 42 0 

II MM I I I II M I I II I II II II I II I II I I I I I II I M II I I I I II I I I I I 

Db 361 CCGGTCGACCCTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 42 0 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 4 80 

Mill II I I I I I I I I II I I I I I II I I I M I II I Mill III I II I I II II I II I 
Db 421 TGCGCAACCCGGCCCAAGCTGAAGAAGATGAAGAGTCAGACAGGAGAGGTGGGCGAGAAG 48 0 

Qy 4 81 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 540 

II Mill II I II I M II I II II II II II M M I I II I I I M I I II I I I I I 
Db 481 CAGTCGCTCAAGTGTGAGGCGGCGGCGGGGAACCCCCAGCCCTCCTATCGATGGTTCAAG 54 0 



Qy 

Db 



541 
541 



600 



600 



Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 601 AACTCACGGCTACAGTTCAACATy^GTGAAGGTGGAGGACGCTGGAGAGTACGTCTGTGAG 660 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 72 0 

II I II I M I I I I I I M II I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I 

Db 661 GCTGAGAACATCCTTGGGAAGGACACTGTGAGGGGCCGGCTCCATGTCAACAGTGTGAGC 720 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGC7VACGAGACAGCCAAGTCCTAT 780 

I I I M I II I I I I I I M I I I I I M I I I M I I I I II I I I M I I M I I II M I I I I I I I 

Db 721 ACCACTCTGTCGTCCTGGTCGGGGCACGCCCGGAAGTGCAATGAGACAGCCAAGTCCTAC 780 

Qy 7 81 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 84 0 

II II I I I I M I I I I I I I I I I I I I I I II I I I II I I I II II I II I I I I M I II III 

Db 781 TGTGTGAATGGAGGCGTGTGCTACTACATCGAAGGCATCAACCAACTCTCCTGCAAATGT 840 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

II I I I I I III I I I I I I III 

Db 841 CCAAACGGATTCTTCGGACAGAGATGTTTGGAG 873 
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2947 bp mRNA linear 
rtiRNA for NTAK alphal, complete cds . 



Chordata; 
Rodentia ; 



Craniata; Vertebrata; Euteleostomi; 
Sciurognathi; Muridae; Murinae; 



CDS 



D89995 2947 bp mRNA linear ROD 07-FEB-1999 

Rattus sp. 
D89995 

D89995.1 GI:2605629 
NTAK alphal. 
Rattus sp. 
Rattus sp. 
Eukaryota ; Metazoa; 
Mammalia; Eutheria; 
Rattus . 

1 (sites) 

Higashiyama, S . , Horikawa,M., Yamada,K., Ichino,N., Nakano^N., 
Nakagawa,T., Miyagawa,J,, Matsushita^ N . , Nagatsu,T., Taniguchi^N. 
and Ishiguro,H. 

A novel brain-derived member of the epidermal growth factor family 

that interacts with ErbB3 and ErbB4 

J. Biochem. 122 (3), 675-680 (1997) 

98006324 

9348101 

2 (bases 1 to 2947) 
Ishiguro, H. 

Direct Submission 

Submitted ( 21-DEC-1996) Hiroshi Ishiguro, Fujita Health University, 
ICMS; 1-98, kutsukake-cho, Toyoake, Aichi 470-11, Japan 
(E-mail:hishi@fujita-hu. ac.jp, Tel : 0562-93-9393 , Fax: 0562-93-8 831) 
Sequence updated ( 2 8-Feb-1997 ) by:Hiroshi Ishiguro. 

Location/Qualifiers 

1. .2947 

/organism="Rattus sp . " 
/mol_type="mRNA" 
/db_xref="taxon: 10118" 
/cell_line="PC12" 
/cell_type="pheochromocytoma" 
79. .2685 



/ codon_start=l 
/product="NTAK alphal" 
/protein_id="BAA23344. 1" 
/db_xref="GI: 2605630" 

/translation="MRQVCCSALPPPLEKARCSSYSYSDSSSSSSSNNSSSSTSSRSS 
SRSSSRSSRGSTTTTSSSENSGSNSGSIFRPAAPPEPRPQPQPQPRSPAARRAAARSR 
AAAAGGMRRDPAPGSSMLLFGVSLACYSPSLKSVQDQAYKAPVWEGKVQGLAPAGGS 
SSNSTREPPASGRVALVKVLDKWPLRSGGLQREQVISVGSCAPLERNQRYIFFLEPTE 
QPLVFKTAFAPVDPNGKNIKKEVGKILCTDCATRPKLKKMKSQTGEVGEKQSLKCEAA 
AGNPQPSYRWFKDGKELNRSRDIRIKYGNGRKNSRLQFNKVKVEDAGEYVCEAENILG 
KDTVRGRLHVNSVSTTLSSWSGHARKCNETAKSYCVNGGVCYYIEGINQLSCKCPNGF 
FGQRCLEKLPLRLYMPDPKQKHLGFELKEAEELYQKRVLTITGICVALLWGIVCWA 
YCKTKKQRRQMHHHLRQNMCPAHQNRSIANGPSHPRLDPEEIQMADYISKNVPATDHV 
IRREAETTFSGSHSCSPSHHCSTATPTSSHRHESHTWSLERSESLTSDSQSGIMLSSV 
GTSKCNSPACVEARARRAAAYSQEERRRAAMPPYHDSIDSLRDSPHSERYVSALTTPA 
RLSPVDFHYSLATQVPTFEITSPNSAHAVSLPPAAPISYRLAEQQPLLRHPAPPGPGP 
GPGADMQRSYDSYYYPAAGPGPRRGACALGGSLGSLPASPFRIPEDDEYETTQECAPP 
PPPRPRTRGASRRTSAGPRRWRRSRLNGLAAQRARAARDSLSLSSGSGCGSASASDDD 
ADDADGALAAESTPFLGLRAAHDALRSDSPPLCPAADSRTYYSLDSHSTRASSRHSRG 
PPTRAKQDSGPL" 

ORIGIN 

Query Match 82.2%; Score 737; DB 10; Length 2947; 

Best Local Similarity 90.3%; Pred. No. 1.3e-130; 

Matches 78 8; Conservative 0; Mismatches 85; Indels 0; Gaps 0; 

Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I M I II I I I M M I I I I II I I I I I I 
Db 403 ATGAGGCGCGACCCGGCCCCCGGCTCCTCGATGCTGCTCTTCGGTGTGTCACTCGCCTGC 4 62 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

I M I I I I I I M I I I I I I I I I M I I I I I I I I M I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 463 TACTCGCCCAGCCTCAAGTCCGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 522 

Qy 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 18 0 

I I M I I I I M I I I I MM III II II II I II I II M I II I I II II II II I I I I I 

Db 523 GGCT^GGTACAGGGACTGGCCCCGGCAGGCGGTTCCAGCTCTAACAGCACCCGAGAGCCT 582 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 240 

I I II II I II II II I I I I M II I I I I II II I M I I I II I II M I I II I I I II I I I II II 

Db 583 CCCGCCTCGGGTCGGGTGGCGCTGGTGAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 642 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

II II I II I M I II II I I II M I II I I M II M II I I I I I I II II II I M II I I I I I I I 

Db 643 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGCGCGCCGCTCGAAAGGAACCAG 702 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 3 60 

I II I M M M I II II II II II I II I I M M II I I I II M II II II M M II I I I I I 

Db 703 CGCTACATCTTTTTCCTGGAGCCCACCGAGCAGCCCTTAGTTTTTAAGACAGCCTTTGCC 7 62 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 420 

II MM I I II I II I II I I M M M I II II II II M II II M I II M II I I I I I 

Db 763 CCGGTCGACCCTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 822 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 4 80 

Mill I II II II II I M I I I I I I M M II I M I I II II III M I M II I I I I I I 

Db 823 TGCGCAACCCGGCCCAAGCTGAAGAAGATGAAGAGTCAGACAGGAGAGGTGGGCGAGAAG 882 



Qy 4 81 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 540 

II I II I I I II I I II I I I I II II II II I I I M I I I I I I II II I I I I II I I I 
Db 8 83 CAGTCGCTCAAGTGTGAGGCGGCGGCGGGGAACCCCCAGCCCTCCTATCGATGGTTCAAG 942 

Qy 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

II I I I I I I I I I I II I II M II II I I I I I I I I I I I I I I I I I I I I I I I M I M II I I 
Db 943 GACGGCAAGGAGCTCAACCGGAGTCGTGACATTCGCATCAAGTATGGCAACGGCAGAAAG 1002 

Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I M I I I I I I I I I I I II I I I Ml 

Db 1003 AACTCACGGCTACAGTTCAACAAAGTGAAGGTGGAGGACGCTGGAGAGTACGTCTGTGAG 1062 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 720 

II I I I I II I II II I II I I I I I I M II I I I I I I I I I I I I I I I I I I I I II I II 

Db 1063 GCTGAGAACATCCTTGGGAAGGACACTGTGAGGGGCCGGCTCCATGTCAACAGTGTGAGC 1122 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 78 0 

Mill I I I I I M M I II II I II II II II II I I I II I I M II I II I I I M M M I II 

Db 1123 ACCACTCTGTCGTCCTGGTCGGGGCACGCCCGGAAGTGCAATGAGACAGCCAAGTCCTAC 1182 

Qy 7 81 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 840 

II II II II II I I I I I I I I I II II II I I M I I I I I II I I I I I M M I I I I I I III 
Db 1183 TGTGTGAATGGAGGCGTGTGCTACTACATCGAAGGCATCT^CCAACTCTCCTGCAAATGT 1242 

Qy 8 41 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

II Mill III I M I I I III 

Db 1243 CCAAACGGATTCTTCGGACAGAGATGTTTGGAG 1275 



RESULT 8 

E16456 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



Craniata; Vertebrata; Euteleos tomi ; 
Sciurognathi; Muridae; Murinae; 



E16456 3076 bp DNA linear PAT 28-JUL-1999 

Rat mRNA for neuregulin-like Transmembrane Activator for ErbB 
Kinases (NTAK) . 
E16456 

E16456.1 GI:5711139 
JP 1998179166-A/l, 
Rattus sp. 
Rattus sp. 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
Rattus . 

1 (bases 1 to 3076) 

Higashiyama, S . , Taniguchi, N . , Ishiguro,K. and Nagatsu^T. 

GENE ENCODING RECEPTOR TYPE TYROSINE-KINASE ERB B LIGAND AND ITS 

Patent: JP 1998179166-A 1 07-JUL-1998; 

HIGASHIYAMA SHIGEKI 

OS Rattus sp. (rat) 

PN JP 1998179166-A/l 

PD 07-JUL-1998 

PF 25-DEC-1996 JP 1996356998 

PI HIGASHIYAMA SHIGEKI, TANIGUCHI NAOYUKI , ISHIGURO KEIJI, PI 
NAGATSU TOSHIHARU 

PC C12N15/09,C07K14/705,C07K16/28,C12N5/10,C12N15/02,C12P21/02, 
PC C12P21/08, 

PC C12Ql/68,G01N33/53,G01N33/566//A61K31/70,A61K38/46,A61K39/395, 



PC A61K48/00, 

PC C07H21/04, (C12N5/10,C12R1; 91) , (C12P21/02 , C12R1 : 91 ) ; CC 
strandedness : Double; 
CC topology: Linear; 

FH Key Location/Qualifiers 
FH 

FT source 1. .3076 

FT /organism= ' Rattus sp . * 

FT /cell_line= ' PC12 ' 

FT CDS 232. ,2814 

FT /product=*NTAK protein*. 

FEATURES Location/Qualifiers 
source 1. .3076 

/organism="Rattus sp." 

/mol_type=" genomic DNA" 

/db_xref="taxon: 10118" 

ORIGIN 

Query Match 82.2%; Score 737; DB 6; Length 3076; 

Best Local Similarity 90.3%; Pred. No. l,3e-130; 

Matches 78 8; Conservative 0; Mismatches 85; Indels 0; Gaps 0; 

ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I M M I I I M I I I I I I I I I I I I I I III I I I I II I I I I I M I I II I I I I I I I I I M I 

ATGAGGCGCGACCCGGCCCCCGGCTCCTCGATGCTGCTCTTCGGTGTGTCACTCGCCTGC 615 

TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 
I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I II I I I I I I I II M I I I II II I I I 
TACTCGCCCAGCCTCAAGTCCGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 675 

GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 180 
I I I I I I I I I I I II I I I I I III II II I I I I I II I II I I I I I I I I I I I I I I I I I I 
GGCAAGGTACAGGGACTGGCCCCGGCAGGCGGTTCCAGCTCTAACAGCACCCGAGAGCCT 735 

CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

I I I I I I I I I M II I I I I I I I I MM I I II I M II I I I I I M II M I I II II I M I II I 

CCCGCCTCGGGTCGGGTGGCGCTGGTGAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 795 
GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

M I II I I II M II M I M I II II II I I M I II I II II I I II I II II II I II M I I II I 

GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGCGCGCCGCTCGAAAGGAACCAG 855 

CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I I I I M I II I I II II I I II I I II II I II II I II II I I II I M I II II M I II M II 
CGCTACATCTTTTTCCTGGAGCCCACCGAGCAGCCCTTAGTTTTTAAGACAGCCTTTGCC 915 

CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 420 

II MM I I I I I I I I II II I II I I M II I I I I I I I I I I II I II II M II I I I M 
CCGGTCGACCCTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 975 

TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 4 80 

Mill II I M M II I I I I II II I I M I I M I I I II II I III I M I I M I I I II I 

TGCGCAACCCGGCCCAAGCTGAAGAAGATGAAGAGTCAGACAGGAGAGGTGGGCGAGAAG 1035 
CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 540 

M I I II I M I I I II I I M II II II II II I I M M II I II II I I II II II I 

CAGTCGCTCAAGTGTGAGGCGGCGGCGGGGAACCCCCAGCCCTCCTATCGATGGTTCAAG 1095 



Qy 


1 


Db 


556 


Qy 


61 


Db 


616 


Qy 


121 


Db 


676 


Qy 


181 


Db 


736 


Qy 


241 


Db 


796 


Qy 


301 


Db 


856 


Qy 


361 


Db 


916 


Qy 


421 


Db 


976 


Qy 


481 


Db 


1036 



Qy 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

M I I I I M I I I I I I I I M I II M II II I I I I I I II II I I I I I I I I II I II I I M I 
Db 1096 GACGGCAAGGAGCTCAACCGGAGTCGTGACATTCGCATCAAGTATGGCAACGGCAGAAAG 1155 

Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I M M I I I II I II I I I M I I I I II M I I I I I I I I I I I I M I I Mill Mill III 

Db 1156 AACTCACGGCTACAGTTCAACAAAGTGAAGGTGGAGGACGCTGGAGAGTACGTCTGTGAG 1215 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 72 0 

II M I II I I I M I I I I I I II I II I II II II II I II I I M I II I M MINI 

Db 1216 GCTGAGAACATCCTTGGGAAGGACACTGTGAGGGGCCGGCTCCATGTCAACAGTGTGAGC 1275 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGG7VAGTGCAACGAGACAGCCAAGTCCTAT 78 0 

I I I I I I I I I I I M II II I I M II II I II I M I II II I M I II M I I II II II II I I 

Db 1276 ACCACTCTGTCGTCCTGGTCGGGGCACGCCCGGAAGTGCAATGAGACAGCCAAGTCCTAC 1335 

Qy 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 840 

II M II II II II I II II M II II M II I I I II II I II II I II I II I II I II III 

Db 1336 TGTGTGAATGGAGGCGTGTGCTACTACATCGAAGGCATC7\ACCAACTCTCCTGCAAATGT 1395 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

II II I I I III I I I M I III 

Db 1396 CCAAACGGATTCTTCGGACAGAGATGTTTGGAG 142 8 



RESULT 9 

D89996 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



3077 bp mRNA linear 
mRNA for NTAK alpha2, complete cds . 



ROD 07-FEB-1999 



Chordata; 
Rodentia; 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi; Muridae; Murinae; 



Ichino,N. , Nakano^N. , 
Nagatsu,T., Taniguchi,N. 



D89996 
Rattus sp. 
D89996 

D89996. 1 GI :2605631 
NTAK alpha2, 
Rattus sp, 
Rattus sp . 
Eukaryota; Metazoa; 
Mammalia ; Eutheria ; 
Rattus . 

1 (sites) 

Higashiyama,S. , Horikawa^M. , Yamada,K. , 
Nakagawa,T., Miyagawa,J., Matsushita, N . 
and Ishiguro,H. 

A novel brain-derived member of the epidermal growth factor family 

that interacts with ErbB3 and ErbB4 

J. Biochem. 122 (3), 675-680 (1997) 

98006324 

9348101 

2 (bases 1 to 3077) 
Ishiguro, H . 

Direct Submission 

Submitted (21-DEC-1996) Hiroshi Ishiguro, Fujita Health University, 

ICMS; 1-98, kutsukake-cho, Toyoake, Aichi 470-11, Japan 

( E-mail :hishi@fujita-hu.ac,jPr Tel : 0562-93-9393, Fax : 0562-93-8 831 ) 

Location/Qualifiers 

1. .3077 

/organism="Rattus sp . " 
/mol_type="mRNA" 



/db_xref="taxon: 10118" 
/cell_line-"PC12" 
/cell_type="pheochromocytoma " 
CDS 233, .2815 

/codon_start=l 
/product="NTAK alpha2" 
/protein_id="BAA23345. 1" 
/db__xref="GI : 2605632" 

/translation="MRQVCCSALPPPLEKARCSSYSYSDSSSSSSSNNSSSSTSSRSS 
SRSSSRSSRGSTTTTSSSENSGSNSGSIFRPAAPPEPRPQPQPQPRSPAARRAAARSR 
AAAAGGMRRDPAPGSSMLLFGVSLACYSPSLKSVQDQAYKAPVWEGKVQGLAPAGGS 
SSNSTREPPASGRVALVKVLDKWPLRSGGLQREQVISVGSCAPLERNQRYIFFLEPTE 
QPLVFKTAFAPVDPNGKNIKKEVGKILCTDCATRPKLKKMKSQTGEVGEKQSLKCEAA 
AGNPQPSYRWFKDGKELNRSRDIRIKYGNGRKNSRLQFNKVKVEDAGEYVCEAENILG 
KDTVRGRLHVNSVSTTLSSWSGHARKCNETAKSYCVNGGVCYYIEGINQLSCKCPNGF 
FGQRCLEKLPLRLYMPDPKQKAEELYQKRVLTITGICVi^LLWGIVCWAYCKTKKQR 
RQMHHHLRQNMCPAHQNRSLANGPSHPRLDPEEIQMADYISKNVPATDHVIRREAETT 
FSGSHSCSPSHHCSTATPTSSHRHESHTWSLERSESLTSDSQSGIMLSSVGTSKCNSP 
ACVEARARR7W^.YSQEERRRAAMPPYHDSIDSLRDSPHSERYVSALTTPARLSPVDFH 
YSLATQVPTFEITSPNSAHAVSLPPAAPISYRLAEQQPLLRHPAPPGPGPGPGADMQR 
SYDSYYYPAAGPGPRRGACALGGSLGSLPASPFRIPEDDEYETTQECAPPPPPRPRTR 
GASRRTSAGPRRWRRSRLNGLAAQRARAARDSLSLSSGSGCGSASASDDDADDADGAL 
AAESTPFLGLRAAHDALRSDSPPLCPAADSRTYYSLDSHSTRASSRHSRGPPTRAKQD 
SGPL" 

ORIGIN 

Query Match 82.2%; Score 737; DB 10; Length 3077; 

Best Local Similarity 90.3%; Pred. No. 1.3e-130; 

Matches 788; Conservative 0; Mismatches 85; Indels 0; Gaps 0; 

1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 
I I M I I I I I I I I M I I I I I I I I I I I III I I I I I I I II I I I I I II I I II I I II I I I I I 
557 ATGAGGCGCGACCCGGCCCCCGGCTCCTCGATGCTGCTCTTCGGTGTGTCACTCGCCTGC 616 

61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 120 

INN II I I I II I II I I II I I I I I I M I I I M M I I I I I I I I I II M II M I II 

617 TACTCGCCCAGCCTCAAGTCCGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 676 
121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 180 

M I I I I I I I M M I II I I III II Mill I II I I I I I M I I M I I I I I I M I I I 

677 GGC7UVGGTACAGGGACTGGCCCCGGCAGGCGGTTCCAGCTCTAACAGCACCCGAGAGCCT 736 
181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

I M M I I I I I I I I I I I II I II II I I I I M I I I II I M I I I I I I I II I I I M I I I I I I I 

737 CCCGCCTCGGGTCGGGTGGCGCTGGTGAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 7 96 
241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 3 00 

I M I I M I I M M I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I M I I I 

797 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGCGCGCCGCTCGAAAGGT^CCAG 856 
301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I I I I I I M I I I M I I I I I M I I I I I I M I M II I I I I M II I I I I I I M I I I II M 

8 57 CGCTACATCTTTTTCCTGGAGCCCACCGAGCAGCCCTTAGTTTTTAAGACAGCCTTTGCC 916 

3 61 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 420 

II I I I I I M I I I I I I I I I M I I I I I I I I II I II I I I I I I I M I II I I I I I I I I 
917 CCGGTCGACCCTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 976 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 48 0 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I Mill III I I II I I I I II I I I 
977 TGCGCAACCCGGCCCAAGCTGAAGAAGATGAAGAGTCAGACAGGAGAGGTGGGCGAGT^G 1036 

481 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 54 0 

N I I I I I I I I I M I I I I I II II II II I I M M I I Mill II 11 I I I M I I 

Db 1037 CAGTCGCTCAAGTGTGAGGCGGCGGCGGGGAACCCCCAGCCCTCCTATCGATGGTTCAAG 1096 

Qy 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

N M I II I I I I M II I II I II II I I I I I I II II II M M II I II M II II I II II 

Db 1097 GACGGCAAGGAGCTCAACCGGAGTCGTGACATTCGCATCAAGTATGGCAACGGCAGAAAG 1156 

Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I I I M I II II II II M II II II II II II I II M I I II II I M I I II I II II I III 

Db 1157 AACTCACGGCTACAGTTCAACAAAGTGAAGGTGGAGGACGCTGGAGAGTACGTCTGTGAG 1216 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 72 0 

II I II I I M I I I I M II II II I II II II II II I I II I II M I II I II M II 

Db 1217 GCTGAGAACATCCTTGGGAAGGACACTGTGAGGGGCCGGCTCCATGTCAACAGTGTGAGC 1276 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 78 0 

I I I I I I I II I I I I M I I I I I II II II I I I I I I M I II M II M I II I I I I II II II 

Db 1277 ACCACTCTGTCGTCCTGGTCGGGGCACGCCCGGAAGTGCAATGAGACAGCCAAGTCCTAC 133 6 

Qy 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 840 

II II M I I II I I I I I II M I II I I I I I I I I I II M M I I I II II I I I I II I III 

Db 1337 TGTGTGAATGGAGGCGTGTGCTACTACATCGAAGGCATCAACCAACTCTCCTGCAAATGT 1396 

Qy 8 41 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 8 73 

II I II I I I I I I II II I I M 

Db 13 97 CCAAACGGATTCTTCGGACAGAGATGTTTGGAG 1429 

RESULT 10 
AR098146 

LOCUS AR098146 1476 bp DNA linear PAT 14-FEB-2001 

DEFINITION Sequence 7 from patent US 6074841, 

ACCESSION AR098146 

VERSION AR098146.1 GI: 12807403 
KEYWORDS 

SOURCE Unknown. 

ORGANISM Unknown, 

Unclassified . 

REFERENCE 1 (bases 1 to 1476) 

AUTHORS Gearing, D. P. and Busf ield, S . J . 

TITLE Don-1 gene and polypeptides and uses therefor 

JOURNAL Patent: US 6074841-A 7 13-JUN-2000; 
FEATURES Location/Qualifiers 



source 1. .1476 

/ organism="unknown" 
/mol_type="unassigned DNA" 

ORIGIN 



Query Match 47.7%; Score 427.8; DB 6; 

Best Local Similarity 89.8%; Pred. No, 1.3e-71; 
Matches 459; Conservative 0; Mismatches 52; 



Length 1476; 
Indels 0; Gaps 



0; 



Qy 363 CCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTG 422 

M I I I I I III I II I II I I I I I I II 

98 CCGCGGCAAGAAGCACCCAGAGGGGAGGAAGCGGGAGAGGGAGCCCGATCCCGGGGAGAA 157 

Qy 423 CGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 482 

I I I N I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I II I I I I I I I I I I I I I II I 
158 AGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 217 

Qy 483 ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 542 

N M M I I I I I I I I I I I I I M I M I I M I M M M I I I I I I I I I I I I I II I I I I II I M I 

218 ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 27 7 
Qy 543 TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 602 

I I N N I I I I I I I I I I I I I M I I M I I II I I I I I I I I I I I I I I I I I I II I I I M I II I I I 

278 TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 337 
Qy 603 CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 662 

M I I I I I I I I I I I I I I I I I M I II I I I I I I M I M I I I I I I I I I I I I M I II II M I I I I 

Db 33 8 CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 397 

Qy 663 CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 722 

I I I I I I M I I M I I I I M I I I I M I I I I I M I I I I I I I I I I I I I I I I M I I I M I M I I I 

Db 398 CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 457 

Qy 723 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 7 82 

I I I M I I I M II I I I I I M I I I II I I M II I I I I I I M I I I I I I I I I I I I I I I M II I M 

Db 458 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 517 

Qy 7 83 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCC 842 

I I I M I I M I I I I M I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M II I M 

Db 518 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCC 577 

Qy 843 TGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

Mill Ml I I I I M Ml 
Db 578 AAATGGATTCTTCGGACAGAGATGTTTGGAG 608 



RESULT 11 

AR116618 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



ORIGIN 



AR116618 1476 bp DNA linear 

Sequence 7 from patent US 6133423. 

AR116618 

AR116618 .1 GI : 1409694 0 

Unknown . 

Unknown . 

Unclassified, 

1 (bases 1 to 1476) 

Gearing, D. P. and Busf ield, S . J . 

Don-1 gene and polypeptides and uses therefor 
Patent: US 6133423-A 7 17-OCT-2000; 

Location/Qualifiers 

1. .1476 

/organism= "unknown" 
/mol_type="unassigned DNA" 



PAT 16-MAY-2001 



Query Match 47.7%; Score 427.8; DB 6; Length 1476; 

Best Local Similarity 89.8%; Pred. No. 1.3e-71; 

Matches 459; Conservative 0; Mismatches 52; Indels 0; Gaps 0; 



Qy 


363 


CCTCGATACC7\ACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTG 
1 1 1 1 1 M III 1 II 1 1 1 1 1 1 1 1 1 II 
CCGCGGCAAGAAGCACCCAGAGGGGAGGAAGCGGGAGAGGGAGCCCGATCCCGGGGAGAA 


422 


Db 


98 


157 


Qy 


423 


CGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 

1 1 1 1 M 1 1 M M M M 1 1 1 M 1 M 1 1 II II 1 1 II 1 II 1 1 II 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 

AGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 


482 


Db 


158 


217 


Qy 


483 


ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 
N 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 
ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 


542 


Db 


218 


277 


Qy 


543 


TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 

1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 

TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 


602 


Db 


278 


337 


Qy 


603 


CTCACGACTACAGTTCAACT^GGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 

N 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 M 1 1 1 1 1 1 1 1 M 1 II II 1 1 1 

CTCACGACTACAGTTCAACAAGGTGT^GGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 


662 


Db 


338 


397 


Qy 


663 


CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 

1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 M 1 II 1 M 1 1 1 

CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 


722 


Db 


398 


457 


Qy 


723 


CACCCTGTCATCCTGGTCGGGGCACGrrrGnAAr^Tf:;rAArr^Ar:ararrrnar'pnr'Ti7\rpm-- 

1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 

CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 


^7 0 0 


Db 


458 


517 


Qy 


783 


CGTCT^TGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCC 

1 N 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 II M 1 1 1 M 1 1 1 1 1 II 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 

CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCC 


842 


Db 


518 


577 


Qy 


843 


TGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

1 II 1 1 III 1 11 II 1 III 
AAATGGATTCTTCGGACAGAGATGTTTGGAG 608 




Db 


578 





RESULT 12 

AR098155 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 



AR098155 2268 bp DNA 

Sequence 31 from patent US 6074841. 
AR098155 

AR098155. 1 GI: 12807412 



Unknown. 

Unknown . 

Unclassified. 

1 (bases 1 to 2268) 

Gearing, D. P. and Busf ield, S . J. 

Don-1 gene and polypeptides and uses therefor 
Patent: US 6074841-A 31 13-JUN-2000; 
Location/Qualifiers 



linear PAT 14-FEB-2001 



ORIGIN 



source i. .2268 

/organism= "unknown" 
/mol_type="unassigned DNA" 



Query Match 47.7%; Score 427.8; DB 6; Length 2268; 

Best Local Similarity 89.8%; Pred. No. 1.2e-71; 

Matches 459; Conservative 0; Mismatches 52; Indels 0; Gaps 0 



Qy 


363 


CCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTG 
M 1 1 1 1 1 III MM II 1 1 1 1 1 1 II 
CCGCGGCAAGAAGCACCCAGAGGGGAGGAAGCGGGAGAGGGAGCCCGATCCCGGGGAGAA 


422 


Db 


98 


157 


Qy 

Db 


423 
158 


CGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 

M II 1 1 1 1 1 1 1 M II 1 II 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 II 1 1 II 1 1 1 1 

AGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 


482 
217 


Qy 


483 


ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 542 
1 M 1 1 1 1 1 M M 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II II II 1 1 1 1 II 1 1 1 II 1 1 
ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 277 


Db 


218 


Qy 


543 


TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 602 

1 1 1 1 M 1 1 M M 1 1 1 1 1 1 1 1 M M 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 

TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 337 


Db 


278 


Qy 

Db 


603 
338 


CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 

1 1 1 1 1 M 1 1 1 1 M II 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 M 1 1 II 1 1 1 1 

CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 


662 
397 


Qv 
Db 


663 
398 


CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 
1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 
CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 


722 
457 


Qy 


723 


CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 

1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 M M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 

CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 


782 


Db 


458 


517 


Qy 


783 


CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCC 

1 1 M M II 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCC 


842 


Db 


518 


577 


Qy 


843 


TGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

1 1 1 1 1 III 1 1 1 1 1 1 III 
AAATGGATTCTTCGGACAGAGATGTTTGGAG 608 




Db 


578 





RESULT 13 

AR116627 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 



AR116627 2268 bp DNA 

Sequence 31 from patent US 6133423. 
AR116627 

AR116627. 1 GI: 14 096949 

Unknown. 

Unknown. 

Unclassified. 

1 (bases 1 to 2268) 



linear PAT 16-MAY-2001 



AUTHORS Gearing, D. P. and Bus field, S . J. 

TITLE Don-1 gene and polypeptides and uses therefor 

JOURNAL Patent: US 6133423-A 31 17-OCT-2000; 
FEATURES Location/Qualifiers 
source 1, .2268 

/organism="unknown" 
/inol_type="unassigned DNA" 

ORIGIN 

Query Match 47.7%; Score 427,8; DB 6; Length 2268; 

Best Local Similarity 89.8%; Fred. No. 1.2e-71; 

Matches 459; Conservative 0; Mismatches 52; Indels 0; Gaps 0; 

Qy 363 CCTCGATACCAACGGCAAAAATCTCAAGAT^GAGGTGGGCAAGATCCTGTGCACTGACTG 422 

I I I I I I I III I I I I I I I I I I I I II 

Db 98 CCGCGGC7\AG7^GCACCCAGAGGGGAGGAAGCGGGAGAGGGAGCCCGATCCCGGGGAGAA 157 

Qy 423 CGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAG7\AGCA 4 82 

I I M I I I I I I I I II I I M I M I I I M I I I I I I I I I I I I I I II II MM I I I I I I M II I 

Db 158 AGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 217 

Qy 4 83 ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 542 

M II II M II II M II II M II II II M I II M II II M II II II M II II II I II II M 

Db 218 ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 277 

Qy 543 TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 602 

I I I I I I I I I I I I I I I M II II II M M II M II II M M II II II II II M II II II I I I 

Db 278 TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 337 

Qy 603 CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 662 

I I I I I I II I I M II II I II M II II II II II II II II II II II II M M II II I II II M 
Db 338 CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 397 

Qy 663 CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 722 

M II II II II M II M M M II II II M I M M M II II II M II M II II II II I M II 

Db 398 CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 457 

Qy 723 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 782 

I M M II II M M II II II II M M M M II II M II M II II II II II II II II II I II 

Db 458 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 517 

Qy 7 83 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCC 842 

I I I I M M II II M M II M II II II M I II II M M II M II II II II II II I Mill 

Db 518 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCC 577 

Qy 843 TGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

II II I III I II M I III 
Db 578 AAATGGATTCTTCGGACAGAGATGTTTGGAG 608 



RESULT 14 
AB001576 

LOCUS 7VB001576 2188 bp mRNA linear ROD 13-FEB-1999 

DEFINITION Rattus sp . mRNA for NTAK alpha2-lp, partial cds . 

ACCESSION AB001576 

VERSION AB001576.1 GI:2605478 

KEYWORDS neural- and thymus-derived activator for ErbB kinases. 



SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



Rattus sp. 
Rattus sp . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1 (sites) 

Higashiyama,S., Horikawa,M. , Yamada,K., Ichino^N., Nakano,N., 
Nakagawa,T., Miyagawa,J., Matsushita, N . , Nagatsu,T., Taniguchi.N. 
and Ishiguro,H. 

A novel brain-derived member of the epidermal growth factor family 

that interacts with ErbB3 and ErbB4 

J. Biochem. 122 (3), 675-680 (1997) 

98006324 

9348101 

2 (bases 1 to 2188) 
Ishiguro,H. 

Direct Submission 

Submitted ( 04-MAR-1997 ) Hiroshi Ishiguro, Fujita Health University, 

ICMS; 1-98, kutsukake-cho, Toyoake, Aichi 470-11, Japan 

(E-mail :hishi(afujita-hu. ac.jp, Tel : 0562-93-9393 , Fax : 0562-93-8831 ) 

Location/Qualifiers 

1. .2188 

/ organism="Rattus sp . " 

/mol_type="mRNA" 

/db_xref-"taxon: 10118" 

/cell_line="PC12" 

/ cell_type="pheochromocytoma " 

<1. .1926 

/note="neural- and thymus -derived activator for ErbB 

kinases (NTAK) ; a member of the epidermal growth factor 

(EGF) family" 

/ codon_start=l 

/product="NTAK alpha2-lp" 

/protein_id="BAA23348. 1" 

/db_xref="GI;2605479" 

/translation="FFFFKTAFAPVDPNGKNIKKEVGKILCTDCATRPKLKKMKSQTG 
EVGEKQSLKCEAAAGNPQPSYRWFKDGKELNRSRDIRIKYGNGRKNSRLQFNKVKVED 
AGE YVCEAEN I LGKDT VRGRLHVNS VSTTLS S WS GHARKCNETAKS YCVNGGVC YYI E 
GINQLSCKCPNGFFGQRCLEKLPLRLYMPDPKQKAEELYQKRVLTITGICVALLWGI 
VCWAYCKTKKQRRQMHHHLRQNMCPAHQNRSLANGPSHPRLDPEEIQMADYISKNVP 
ATDHVIRREAETTFSGSHSCSPSHHCSTATPTSSHRHESHTWSLERSESLTSDSQSGI 
MLSSVGTSKCNSPACVEARARRAAAYSQEERRRAAMPPYHDSIDSLRDSPHSERYVSA 
LTTPARLSPVDFHYSLATQVPTFEITSPNSAHAVSLPPAAPISYRLAEQQPLLRHPAP 
PGPGPGPGADMQRSYDSYYYPAAGPGPRRGACALGGSLGSLPASPFRIPEDDEYETTQ 
ECAPPPPPRPRTRGASRRTSAGPRRWRRSRLNGLAAQRARAARDSLSLSSGSGCGSAS 
ASDDDADDADGALAAESTPFLGLRAAHDALRSDSPPLCPAADSRTYYSLDSHSTRASS 
RHSRGPPTRAKQDSGPL" 



ORIGIN 



Query Match 47.4%; Score 425; DB 10; Length 2188; 

Best Local Similarity 87.0%; Pred. No. 4.2e-71; 

Matches 467; Conservative 0; Mismatches 70; Indels 0; Gaps 0; 

Qy 337 TTAGTCTTTAAGACGGCCTTTGCCCCCCTCGATACCAACGGCT^AAAATCTCAAGAAAGAG 396 

II I II II I I II I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I M I I 
4 TTTTTTTTTAAGACAGCCTTTGCCCCGGTCGACCCTAACGGCAAAAACATCAAGAAAGAG 63 



Qy 

Db 


397 
64 


GTGGGCAAGATCCTGTGCACTGACTGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGC 
1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 
GTGGGCAAGATCCTGTGCACTGACTGCGCAACCCGGCCCAAGCTGAAGAAGATGAAGAGT 


456 
123 


Qy 


457 


CAGACGGGACAGGTGGGTGAGAAGCAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II II II M Ml 
CAGACAGGAGAGGTGGGCGAGAAGCAGTCGCTCAAGTGTGAGGCGGCGGCGGGGAACCCC 


516 


Db 


124 


183 


Qy 


517 


CAGCCTTCCTACCGTTGGTTCAAGGATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGC 
Mill 1 1 1 1 1 II 1 II II 1 1 M 1 1 1 1 II M 1 1 1 1 1 1 1 II 1 1 M II 1 1 1 1 II 1 1 1 
CAGCCCTCCTATCGATGGTTCAAGGACGGCAAGGAGCTCAACCGGAGTCGTGACATTCGC 


576 


Db 


184 


243 


Qy 


577 


ATCAAATATGGCAACGGCAGAAAGAACTCACGACTACAGTTCAACAAGGTGAAGGTGGAG 
Mill M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M II II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 
ATCAAGTATGGCAACGGCAGAAAGAACTCACGGCTACAGTTCAACAAAGTGAAGGTGGAG 


636 


Db 


244 


303 


Qy 


637 


GACGCTGGGGAGTATGTCTGCGAGGCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGC 

M M II II 1 1 11 1 Mill 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 M 1 1 II 1 M 1 1 1 1 1 

GACGCTGGAGAGTACGTCTGTGAGGCTGAGAACATCCTTGGGAAGGACACTGTGAGGGGC 


696 


Db 


304 


363 


Qy 


697 


CGGCTTTACGTCAACAGCGTGAGCACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAG 

Mill 1 1 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 M 

CGGCTCCATGTCAACAGTGTGAGCACCACTCTGTCGTCCTGGTCGGGGCACGCCCGGAAG 


756 


Db 


364 


423 


Qy 


757 


TGCAACGAGACAGCCAAGTCCTATTGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGC 

1 1 1 II 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 It II II 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 Ml 
> > ■ 1 1 1 1 1 1 1 [ t 1 1 1 1 1 1 1 1 1 1 II II i 1 M 1 1 1 1 1 It 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 

TGCAATGAGACAGCCAAGTCCTACTGTGTGAATGGAGGCGTGTGCTACTACATCGAAGGC 


816 


Db 


424 


483 


Qy 


817 


ATCAACCAGCTCTCCTGCAAGTGTCCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 
M II 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 III 
ATCAACCAACTCTCCTGCAAATGTCCAAACGGATTCTTCGGACAGAGATGTTTGGAG 540 




Db 


484 





RESULT 15 

AC094080 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



AC094080 118504 bp DNA linear PRI 27-MAR-2002 

Homo sapiens chromosome 5 clone CTB-77K22, complete sequence. 
AC094080 

AC094 080.4 GI: 19747152 
HTG. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 118504) 

DOE Joint Genome Institute and Stanford Human Genome Center. 

Direct Submission 

Unpublished 

2 (bases 1 to 118504) 

DOE Joint Genome Institute. 
Direct Submission 

Submitted (14-SEP-2001) Production Sequencing Facility, DOE Joint 
Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA 

3 (bases 1 to 118504) 

DOE Joint Genome Institute. 
Direct Submission 

Submitted ( 07-MAR-2002 ) Production Sequencing Facility, DOE Joint 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA 
4 (bases 1 to 118504) 

DOE Joint Genome Institute and Stanford Human Genome Center. 
Direct Submission 

Submitted {27-MAR-2002 ) DOE Joint Genome Institute, 2800 Mitchell 
Drive, Walnut Creek, CA 94598, USA 

On Mar 27, 2002 this sequence version replaced gi: 19224838. 
Draft Sequence Produced by DOE Joint Genome Institute 
www, j gi . doe , gov 

Finishing Completed at Stanford Human Genome Center 
www-shgc. Stanford. edu 

Quality: Phrap Quality >^40 99.7% of Sequence; 
Estimated Total Number of Errors is 0.5. 

Location/Qualifiers 

1. .118504 

/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref="taxon: 9606" 
/chromosome="5" 
/clone="CTB-77K22" 



ORIGIN 



Query Match 47.4%; Score 424.8; DB 9; Length 118504; 

Best Local Similarity 94.2%; Pred. No. 3.5e-71; 

Matches 452; Conservative 0; Mismatches 27; Indels 1; Gaps 1; 



Qy 


1 


ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 
1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 


60 


Db 


80890 


80949 


Qy 


61 


TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 

M 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 


120 


Db 


80950 


81009 


Qy 


121 


GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 

1 1 1 1 II M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M M 1 1 1 1 

GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 


180 


Db 


81010 


81069 


Qy 


181 


CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 

II 1 1 1 1 1 1 1 M M M M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 

CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 


240 


Db 


81070 


81129 


Qy 


241 


GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 

1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M M 

GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 


300 


Db 


81130 


81189 


Qy 

Db 


301 
81190 


CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 
1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 { 1 1 1 1 1 1 1 1 1 { 1 1 { 

CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 


360 
81249 


Qy 


361 


CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 

1 1 N 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 


420 


Db 


81250 


81309 


Qy 


421 


TGC-GCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAA 

nil 1 1 1 III 1 1 1 1 1 M 1 1 1 1 1 

TGCGGTGAGTCGCCCCCTCCCTTTGCTGGAGAAAGGGGGGAGGGGCGAGGTGGTGGAGAA 


479 


Db 


81310 


81369 



I 



Search completed: August 15, 2004, 08:02:13 
Job time : 3832.5 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



August 15, 2004, 02:37:58 ; Search time 430.237 Seconds 

(without alignments) 
8857.048 Million cell updates/sec 

US-09-864-675-3 
897 

1 atgaggcgcgacccggcccc caatggtcaacttctcctaa 897 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 3373863 seqs, 2124099041 residues 

Total number of hits satisfying chosen parameters; 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



6747726 



Database 



N_Geneseq_29 Jan04 : * 

1: geneseqnl980s : * 

2 : geneseqnl990s : * 

3: geneseqn2000s : ^ 

4 : geneseqn2001as : * 

5 : geneseqn2001bs : * 

6: geneseqn2002s : * 

7: geneseqn2003as : * 

8 : geneseqn2003bs : * 

9 : geneseqn2003cs : * 

10: geneseqn2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 



Query 



^o. 


Score 


Match 


Length 


DB 


ID 


1 


897 


100.0 


897 


6 


AAS18020 


2 


849 


94.6 


994 


6 


AAS18019 


3 


835.4 


93.1 


1884 


2 


AAV17814 


4 


784 


87.4 


3441 


2 


AAT87922 


5 


738.6 


82.3 


1803 


2 


AAT87923 


6 


737 


82.2 


3076 


2 


AAV43674 


7 


723.4 


80.6 


1863 


7 


ABS56035 



Description 



Aasl8020 Human cDN 
Aasl8 019 Human cDN 
Aavl7814 Homo sapi 
Aat87922 Rat cereb 
Aat87923 Rat cereb 
Aav43674 Receptor 
Abs56035 cDNA enco 



8 


427.8 


47, 


7 


2268 


2 


AAV17816 


Aavl7816 


Homo sapi 


9 


426. 8 


47, 


6 


1474 


7 


ABS56036 


Abs56036 


cDNA enco 


10 


426. 8 


47, 


6 


2266 


7 


ABS56045 


Abs56045 


cDNA enco 


11 


426.2 


47 . 


5 


1476 


2 


AAV17815 


Aavl7815 


Homo sapi 


12 


424 


47 , 


3 


1054 


6 


ABL40993 


Abl40993 


Human neu 


13 


405, 4 


45. 


2 


1607 


2 


AAV17813 


Aavl7813 


Mus muscu 


14 


402 .2 


44 . 


8 


2467 


2 


AAV17812 


Aavl7812 


Mus muscu 


15 


393.4 


43. 


9 


1561 


7 


ABS56034 


Abs56034 


cDNA enco 


16 


393.4 


43. 


9 


2442 


7 


ABS56033 


Abs56033 


cDNA enco 


17 


256 


28 . 


5 


667 


2 


AAT87924 


Aat87924 


Human cejr 


18 


173 


19, 


3 


419 


6 


ABL40994 


Abl40994 


HiiTnsn ne'U 


19 


124 . 6 


13 , 


9 


480 


6 


ABL40995 


Abl40995 


HuTn^^n nf^u 

1 1 I Id 11 11 ^ ^ 


20 


124 


13. 


8 


350 


6 


ABL40996 


Abl40996 


Human neu 


21 


85 . 6 


9 . 


5 


1986 


2 


AAZ32 061 


Aa z 3 2 0 6 1 


Human MET 

A X L^XLL^l XX X XX-J X 


22 


84 


9. 


4 


1986 


5 


AAC90318 


Aac90318 


L12260 cD 


23 


84 


9. 


4 


2003 


2 


AAT30995 


Aat30995 


Glial qjTo 


24 


84 


9. 


4 


2003 


2 


AAT48090 


Aat48090 


Human neu 


25 


84 


9, 


4 


2003 


2 


AAT06739 


Aat06739 


Glial qro 


26 


84 


9, 


4 


2003 


2 


AAZ32062 


Aaz32062 


Human MET 

X X K^X L LCL XX XX X-J X 


27 


84 


9 . 


4 


2003 


5 


AAC90319 


Aac90319 


136352 cD 


28 


83.4 


9, 


3 


1108 


2 


AAQ30671 


Aaq30671 


GGF2BPP3 


29 


83.4 


9, 


3 


1108 


2 


AAQ62850 


Aaa62850 




30 


83.4 


9. 


3 


1108 


2 


AAQ58304 


Aaq5 8304 


GGF-II cD 


31 


83.4 


9. 


3 


1108 


2 


AAQ74886 


Aaq74 8 86 


Putative 


32 


83 . 4 


9 . 


3 


1108 


2 


AAT30998 


Aat30998 


Rovi ne^ nl 

U V V J_ 11^ iJ ~'~ 


33 


83 . 4 


9, 


3 


1108 


2 


AAT48080 


Aat4 8080 


Bovi T\& Y\& 

X^ Vm/ V X X X X 


34 


83,4 


9. 


3 


1108 


2 


AAT06704 


Aat06704 


Bovine ql 


35 


82.4 


9. 


2 


2003 


2 


AAQ62838 


Aaq628 38 


GGF2HBS5 . 


36 


82.4 


9. 


2 


2003 


2 


AAQ58329 


Aaq58329 


GGF2HBS5 . 


37 


78.2 


8 . 


7 


1027 


5 


AAF80062 


Aaf 80062 


Nucl eot id. 


38 


78.2 


8 . 


7 


3086 


5 


AAF80059 


Aa f 8 0 0 5 9 


Nu c 1 f*o"l" i c3 

li l-A Vih/ JL 


39 


76.8 


8 , 


6 


1140 


2 


AAQ62840 


Aaq62840 


GGF2BPP2 . 


40 


76 . 8 


8 . 


6 


1140 


2 


AAQ58321 


J^rd ^ \J ^ ^ ^ 


GGF^RPP? 


41 


76.8 


8. 


6 


1140 


2 


AAQ74912 


Aaq74912 


Bovine gl 


42 


76,8 


8, 


6 


1140 


2 


AAT31001 


Aat31001 


Glial gro 


43 


76.8 


8. 


6 


1140 


2 


AAT48088 


Aat48088 


Human neu 


44 


76,8 


8, 


6 


1140 


2 


AAT06731 


Aat06731 


BPP2 glia 


45 


76.8 


8. 


6 


1140 


2 


AAX81201 


Aax81201 


Nucleotid 



7iJ.IGNMENTS 



RESULT 1 
7\AS18020 

ID AAS18020 standard; cDNA; 897 BP. 
XX 

AC AAS18020; 
XX 

DT 12-MAR-2002 (first entry) 
XX 

DE Human cDNA encoding Neuregulin-2beta, NRG-2beta, 
XX 

KW Human; ss; neuregulin-2 ; NRG-2alpha; NRG-2beta; mitogenesis; 

KW cell survival; cell growth; cell differentiation; erbB receptor; 

KW cardiomyopathy; ischaemic damage; cardiac trauma; heart failure; 

KW atherosclerosis; vascular lesion; vascular hypertension; 



KW degenerative congenital vascular disease; myasthenia gravis; 

KW neurodegenerative disorder; peripheral neuropathy; 

KW sensory nerve fiber neuropathy; motor fiber neuropathy; 

KW sensory nerve fiber neuropathy; multiple sclerosis; 

KW amyotrophic lateral sclerosis; spinal muscular atrophy; nerve injury; 

KW Alzheimer's disease; Parkinson's disease; cerebellar ataxia; 

KW spinal cord injury; tumour; neurofibromatosis; transgenic animal. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualif iers 

FT CDS 1. .8 97 

FT /*tag= a 

FT /product= "NRG-2beta" 

XX 

PN WO200189568-A1. 
XX 

PD 29-NOV-2001. 
XX 

PF 23-MAY-2001; 2001WO-US016896 . 
XX 

PR 23-MAY-2000; 2000US-02 06495P . 
XX 

PA (CENE-) CENES PHARM INC. 
XX 

PI Marchionni MA; 
XX 

DR WPI; 2002-097612/13. 

DR P-PSDB; AAU11636. 
XX 

PT Neuregulin-2 polypeptide and polynucleotide useful for treating multiple 

PT sclerosis, spinal muscular atrophy, nerve injury, Alzheimer's disease, by 

PT increasing mitogenesis, survival, growth or differentiation of a cell. 
XX 

PS Claim 57; Fig 8; 79pp; English. 
XX 

CC The invention relates to a substantially pure neuregulin (NRG) -2 

CC polypeptide comprising or consisting of a sequence for human NRG-2alpha 

CC or NRG-2beta (clone 2b7) and the polynucleotides encoding the. Also 

CC included are a vector expressing the protein, a host cell comprising the 

CC vector, a transgenic non-human animal transformed with the vector or 

CC having a knockout mutation in one or both NRG-2 alleles and an anti-NRG-2 

CC antibody. Analysis of mutations in NRG-2 in an individual is useful for 

CC diagnosing an increased likelihood of developing a NRG-2-related disease 

CC or condition in a test subject. NRG-2 is useful for increasing the 

CC mitogenesis, survival, growth or differentiation of a cell (e.g. a 

CC neuronal cell), where the cell expresses an erbB receptor. NRG-2 is 

CC useful for treating diseases and disorders such as cardiomyopathy 

CC (preferably degenerative congenital disease) , ischaemic damage, cardiac 

CC trauma or heart failure or which has a condition affecting smooth muscle 

CC which include atherosclerosis, vascular lesion, vascular hypertension, 

CC and degenerative congenital vascular disease, myasthenia gravis, a 

CC neurodegenerative disorder, peripheral neuropathy, a sensory nerve fiber 

CC neuropathy, a motor fiber and a sensory nerve fiber neuropathy, multiple 

CC sclerosis, amyotrophic lateral sclerosis, spinal muscular atrophy, nerve 

CC injury, Alzheimer's disease, Parkinson's disease, cerebellar ataxia, and 

CC spinal cord injury. The antibody is useful for treatment of a tumour 



CC comprising inhibiting proliferation of a tumour cell preferably a glial 
CC tumour cell, for treating of neurofibromatosis by inhibiting glial cell 
CC mitogenesis. The present sequence encodes NRG-2beta 
XX 

SQ Sequence 897 BP; 200 A; 261 C; 282 G; 154 T; 0 U; 0 Others- 
Query Match 100.0%; Score 897; DB 6; Length 897; 
Best Local Similarity 100.0%; Pred. No. 1.5e-195; 

Matches 897; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
Db 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

Qy 121 GGC7VAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 180 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M M I I I M I I I I I I I I I I I I I I I I 
Db 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCTVACAGCACCCGAGAGCCG 180 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I 
Db 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 240 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I 
Db 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGT^CAGCCCTTAGTCTTTAAGACGGCCTTTGCC 3 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I 
Db 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 420 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I 
Db 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 420 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 4 80 

I I I M M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I 
Db 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 480 

Qy 481 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 540 

I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I M M I I I I I I I I 
Db 481 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 54 0 

Qy 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 720 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I M I I I I 
Db 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 720 



Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 780 

I I I I I I I I I I I I I I I I I I I M I I I M I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCT^GTCCTAT 780 

Qy 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 84 0 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAGTTCGCAATGGTCAACTTCTCCTAA 897 

I M I I ! I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 
Db 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAGTTCGCAATGGTCAACTTCTCCTAA 897 



RESULT 2 
AAS18019 

ID AAS18019 standard; cDNA; 994 BP. 
XX 

AC AAS18019; 
XX 

DT 12-M7VR-2002 (first entry) 
XX 

DE Human cDNA encoding Neuregulin-2alpha, NRG-2alpha, 
XX 

KW Human; ss; neuregulin-2 ; NRG-2alpha; NRG-2beta; mitogenesis; 

KW cell survival; cell growth; cell differentiation; erbB receptor; 

KW cardiomyopathy; ischaemic damage; cardiac trauma; heart failure; 

KW atherosclerosis; vascular lesion; vascular hypertension; 

KW degenerative congenital vascular disease; myasthenia gravis; 

KW neurodegenerative disorder; peripheral neuropathy; 

KW sensory nerve fiber neuropathy; motor fiber neuropathy; 

KW sensory nerve fiber neuropathy; multiple sclerosis; 

KW amyotrophic lateral sclerosis; spinal muscular atrophy; nerve injury; 

KW Alzheimer's disease; Parkinson's disease; cerebellar ataxia; 

KW spinal cord injury; tumour; neurofibromatosis; transgenic animal. 

XX 

OS Homo sapiens, 
XX 

FH Key Location/Qualifiers 

FT CDS 1. .993 

FT /*tag= a 

FT /product= "NRG-2alpha" 

XX 

PN WO200189568-A1. 
XX 

PD 29-NOV-2001. 
XX 

PF 23-MAY-2001; 2001WO-US016896 . 
XX 

PR 23-MAY-2000; 2000US-02064 95P . 
XX 

PA (CENE-) CENES PHARM INC. 
XX 

PI Marchionni MA; 
XX 

DR WPI; 2002-097612/13. 

DR P-PSDB; 7^U11635. 



PT Neuregulin-2 polypeptide and polynucleotide useful for treating multiple 

PT sclerosis, spinal muscular atrophy, nerve injury, Alzheimer's disease, by 

PT increasing mitogenesis, survival, growth or differentiation of a cell. 
XX 

PS Claim 57; Fig 6; 79pp; English. 
XX 

CC The invention relates to a substantially pure neuregulin (NRG) -2 

CC polypeptide comprising or consisting of a sequence for human NRG-2alpha 

CC or NRG-2beta (clone 2b7) and the polynucleotides encoding the. Also 

CC included are a vector expressing the protein, a host cell comprising the 

CC vector, a transgenic non-human animal transformed with the vector or 

CC having a knockout mutation in one or both NRG-2 alleles and an anti-NRG-2 

CC antibody. Analysis of mutations in NRG-2 in an individual is useful for 

CC diagnosing an increased likelihood of developing a NRG-2-related disease 

CC or condition in a test subject. NRG-2 is useful for increasing the 

CC mitogenesis, survival, growth or differentiation of a cell (e.g. a 

CC neuronal cell) , where the cell expresses an erbB receptor. NRG-2 is 

CC useful for treating diseases and disorders such as cardiomyopathy 

CC (preferably degenerative congenital disease) , ischaemic damage, cardiac 

CC trauma or heart failure or which has a condition affecting smooth muscle 

CC which include atherosclerosis, vascular lesion, vascular hypertension, 

CC and degenerative congenital vascular disease, myasthenia gravis, a 

CC neurodegenerative disorder, peripheral neuropathy, a sensory nerve fiber 

CC neuropathy, a motor fiber and a sensory nerve fiber neuropathy, multiple 

CC sclerosis, amyotrophic lateral sclerosis, spinal muscular atrophy, nerve 

CC injury, Alzheimer's disease, Parkinson's disease, cerebellar ataxia, and 

CC spinal cord injury. The antibody is useful for treatment of a tumour 

CC comprising inhibiting proliferation of a tumour cell preferably a glial 

CC tumour cell, for treating of neurofibromatosis by inhibiting glial cell 

CC mitogenesis. The present sequence encodes NRG-2alpha 

XX 

SQ Sequence 994 BP; 230 A; 279 C; 304 G; 181 T; 0 U; 0 Other; 

Query Match 94.6%; Score 849; DB 6; Length 994; 

Best Local Similarity 98.3%; Pred. No. 1.5e-184; 

Matches 858; Conservative 0; Mismatches 15; Indels 0; Gaps 0 

Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

Qy 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 18 0 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I M M I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 180 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 240 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 30 0 

I I I I I I I I I I I I I I I I I I I I I I I I I.I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 



Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I I I I I I I I I M I I I I M I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCT^GAAAGAGGTGGGCAAGATCCTGTGCACTGAC 42 0 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 420 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 480 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 4 80 

Qy 481 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 540 

Qy 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I M I I I I I I I I I 
Db 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I 
Db 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 720 

I I I I I I M I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
Db 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 720 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 7 80 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 7 80 

Qy 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATC7UVCCAGCTCTCCTGCAAGTGT 840 

I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGT 840 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

II I I I I I III MINI III 

Db 841 CCAAATGGATTCTTCGGACAGAGATGTTTGGAG 873 



RESULT 3 


AAV17814 


ID 


AAV17814 standard; cDNA; 1884 BP. 


XX 




AC 


AAV17814; 


XX 




DT 


17-AUG-1998 (first entry) 


XX 




DE 


Homo sapiens don-1 gene splice variant. 


XX 




KW 


Murine; don-1 gene; melanoma; treatment; adenocarcinoma; epithelial cell; 


KW 


proliferation; stimulation; treatment; tumours; skin; oesophagus; lung; 


KW 


breast; liver; pancreas; colon; prostate; gastrointestinal tract; uterus; 


KW 


wound healing; transmembrane; ss. 


XX 





OS Homo sapiens. 
XX 

FH Key 

FT CDS 
FT 
FT 
XX 

PN WO9807736-A1. 
XX 

PD 26-FEB-1998. 
XX 

PF 18-AUG-1997; 
XX 

PR 19-AUG-1996; 

PR 19-NOV-1996; 
XX 

PA (MILL-) MILLENNIUM BIOTHERAPEUTICS INC. 
XX 

PI Gearing DP, Busfield SJ; 
XX 

DR WPI; 1998-169084/15. 

DR P-PSDB; AAW48381, 
XX 

PT Mouse and human don-1 polypeptide ( s ) - useful for treatment of melanomas 

PT and adenocarcinoma ( s ) , and for wound healing. 

XX 

PS Claim 4; Fig 3; 121pp; English. 
XX 

CC The sequence is that of a human don-1 gene splice variant. Don-1 

CC polypeptides stimulate proliferation of epithelial cells and thus are 

CC implicated in melanomas and adenocarcinomas in which epithelial cells 

CC proliferate out of control. Compounds that interfere with don-1 mediated 

CC cell proliferation can be used in the treatment of tumours such as 

CC melanomas and adenocarcinomas of the skin, oesophagus, lung, breast, 

CC liver, pancreas, gastrointestinal tract, colon, prostate or uterus. 

CC Alternatively, don-1 polypeptides can be used to stimulate epithelial 

CC cell proliferation, e.g. for wound healing 

XX 

SQ Sequence 1884 BP; 426 A; 607 C; 560 G; 291 T; 0 U; 0 Other; 

Query Match 93.1%; Score 835.4; DB 2; Length 1884; 
Best Local Similarity 98.1%; Pred. No. 2.2e-181; 

Matches 856; Conservative 0; Mismatches 16; Indels 1; Gaps 1; 

Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I I I I I I M I I I I I M I I I M I I M I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I 

Db 218 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 277 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 278 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 337 

Qy 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 180 

I I I I I I I I I I I I I I I I I M M I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I M 

Db 338 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 397 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 



Location/ Qualifiers 
664. .1884 
/^tag= a 

/note= "don-1 polypeptide" 



97WO-US014585. 

96US-00699591. 
96US-00753007. 



I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I 

Db 398 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACT^AGTGGCCGGTCCGGAGCGGG 457 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 3 00 

M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 458 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 517 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I M I I I 
Db 518 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 577 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 42 0 

I I I I I M I I I I I I I I I I I I I M M I I I I I I I I I I I I I I M I I I I I I M I I I I I I I M I 

Db 578 CCCCT-GATACCAACGGCAAAAATCTCTJy^^GAAAGAGGTGGGCAAGATCCTGTGCACTGGC 636 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 637 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 696 

Qy 481 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 540 

I I I M I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 697 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 756 

Qy 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I 
Db 757 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 816 

Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I M I I I M I I I I I I I I I I I I I I I I I I I M 
Db 817 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 876 

Qy 661 GCCGAG7\ACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 72 0 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I M I I I I M I I I I 
Db 877 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 936 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 780 

I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
Db 937 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 996 

Qy 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 84 0 

I I I I I I I I { I I I M I I I M I M I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I III 

Db 997 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGT 1056 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

II I I I I I III I I I I I I III 

Db 1057 CCAAATGGATTCTTCGGACAGAGATGTTTGGAG 1089 



RESULT 4 
AAT87922 

ID AAT87922 standard; cDNA; 3441 BP. 
XX 

AC AAT87922; 
XX 

DT 18-DEC-1997 (first entry) 
XX 

DE Rat cerebellum derived growth factor 1 cDNA. 



XX 

KW Rat; cerebellum derived growth factor; CDGFl; screening; binding; 

KW modulation; erbB type receptor; identification; indication; risk; 

KW proliferation; differentiation; induction; neuron; hyperplasia; 

KW stem cell culture; intracerebral graft; alleviation; repair; 

KW behavioural defect; nervous system; central; peripheral; nerve; 

KW prothesis; damage; entubulation; cell survival; treatment; injury; 

KW trauma; ischaemia; ischemia; stroke; infection; disorder; inflammation; 

KW neurodegeneration; disease; Parkinson* s; Huntingdon's; 

KW amylotrophic lateral sclerosis; sensory; retina; 

KW spinocerebellar degeneration; multiple sclerosis; neoplasia; 

KW amalignant glioma; medulloblastoma ; neuroectodermal tumour; ds . 
XX 

OS Rattus rattus , 
XX 

FH Key 

FT CDS 
FT 

FT sig_peptide 
FT 

FT mat_peptide 
FT 
FT 
XX 

PN WO9709425-A1, 
XX 

PD 13-MAR-1997. 
XX 

PF 09-SEP-1996; 
XX 

PR 08-SEP-1995; 
XX 

PA (HARD ) HARVARD COLLEGE. 

PA (STRD ) UNIV LELAND S STANFORD. 

XX 

PI Chang H; 
XX 

DR WPI; 1997-192900/17. 

DR P-PSDB; 7UVW27536, 
XX 

PT Rat and human cerebellum-derived growth factors - used in the treatment 

PT of neuronal injury and proliferative disorders. 

XX 

PS Claim 17; Page 63-66; 94pp; English. 
XX 

CC The present sequence encodes rat cerebellum derived growth factor 1 

CC (CDGFl), which can be used to screen for modulators of CDGF binding to 

CC erbB type receptors. Identification of a modification or mutation in a 

CC CDGF gene, or aberrant expression of a CDGF gene or levels of soluble 

CC CDGF may be used to indicate the risk of unwanted cell proliferation or 

CC differentiation. CDGF may be used to induce neuronal differentiation in 

CC stem cell culture, and maintain the integrity of a terminally 

CC differentiated neuronal cell culture, e.g. useful for intracerebral 

CC grafting to alleviate behavioural defects. CDGF may also be used in nerve 

CC protheses to repair central and peripheral nerve damage, especially where 

CC a crushed or severed axon is entubulated by a prosthetic. CDGF may also 

CC be used to enhance neuronal cell survival in the central or peripheral 



Location/ Qualifiers 

180. ,2444 

/*tag= a 

180. .248 

/*tag= b 

249. .2441 

/*tag= c 

/product= "cerebellum_derived_growth_f actor" 



96WO-US014484 . 
95US-00525864, 



CC nervous system, to treat neurological conditions associated with nervous 

CC system injury, e.g. traumatic, chemical or vasal injury and deficits such 

CC as ischaemia resulting from stroke, infectious/inflammatory and tumour 

CC induced injury, chronic neurodegenerative disease including Parkinson's 

CC and Huntingdon's, amylotrophic lateral sclerosis, spinocerebellar 

CC degeneration, chronic immunological disease of the nervous system 

CC including multiple sclerosis, disorders of the sensory neurons and 

CC degenerative diseases of the retina, CDGF may also be used to treat 

CC neoplastic or hyperplastic transformations, particularly of the central 

CC nervous system, e.g. amalignant gliomas, medulloblastomas and 

CC neuroectodermal tumours 

XX 

SQ Sequence 3441 BP; 777 A; 1057 C; 1015 G; 592 T; 0 U; 0 Other; 

Query Match 87.4%; Score 784; DB 2; Length 3441; 

Best Local Similarity 92.2%; Pred. No. 1.4e-169; 

Matches 826; Conservative 0; Mismatches 70; Indels 0; Gaps 0; 

Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I M 
Db 180 ATGAGGCGCGACCCGGCCCCCGGCTTCTCGATGCTGCTCTTCGGTGTGTCACTCGCCTGC 239 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACT^GGCACCCGTGGTGGTGGAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 240 TACTCGCCCAGCCTCAAGTCCGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 299 

Qy 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 18 0 

I I I I I I I I I I I I I I MM Ml M M M I M M M M M M M M M M M M I 

Db 300 GGCAAGGTACAGGGACTGGCCCCGGCAGGCGGTTCCAGCTCTT^CAGCACCCGAGAGCCT 359 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

M I M M M M M M M I M I MM M M M M M M M M M M M M M M M M I 

Db 360 CCCGCCTCGGGTCGGGTGGCGCTGGTGAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 419 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

M M M I M M M M M M M M M M M M M M M I I M M M M M M M M M I 

Db 420 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGCGCGCCGCTCGAAAGGAACCAG 479 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

M M M M M M M M I M M M M I M M M M I M M M M M M M M M M I 

Db 4 80 CGCTACATCTTTTTCCTGGAGCCCACCGAGCAGCCCTTAGTTTTTAAGACAGCCTTTGCC 539 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 420 

II II I I I II I M I I II M I II I II I I II I II M I I I I II II II I M M I II M 

Db 540 CCGGTCGACCCTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 599 

Qy 421 TGCGCCACCCGGCCCT^AGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 480 

I II I I I I II II M I II I I I II I M I I I II I M I I M M III I I II I I I I I I I I I 

Db 600 TGCGCAACCCGGCCCAAGCTGAAGAAGATGAAGAGTCAGACAGGAGAGGTGGGCGAGAAG 659 

Qy 481 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 54 0 

II Mill I M II I I I I M II II M M M II II M II M I II II M I I I M 

Db 660 CAGTCGCTCAAGTGTGAGGCGGCGGCGGGGAACCCCCAGCCCTCCTATCGATGGTTCAAG 719 

Qy 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 60 0 

II M I I I M I I I M II II I II II I I I I M II II M II II II I M II II M II I I I 

Db 720 GACGGCAAGGAGCTCAACCGGAGTCGTGACATTCGCATCAAGTATGGCAACGGCAGAAAG 779 



Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I Mill I I I I I III 

Db 78 0 AACTCACGGCTACAGTTCAACAAAGTGAAGGTGGAGGACGCTGGAGAGTACGTCTGTGAG 839 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 720 

II I I I II I I I I II I I I I I I I I I II II I I I I I I M I I I I II II I I I I II I I I 

Db 84 0 GCTGAGAACATCCTTGGGAAGGACACTGTGAGGGGCCGGCTCCATGTCAACAGTGTGAGC 899 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGG7\AGTGCAACGAGACAGCCAAGTCCTAT 7 80 

Mill Mill I M I I II II M I II I II I I II II I I II I I I I II I M I I I I II M I I 

Db 90 0 ACCACTCTGTCGTCCTGGTCGGGGCACGCCCGGAAGTGCAATGAGACAGCCAAGTCCTAC 959 

Qy 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 840 

II II M II II I II II I M II I II II I II I II I I I M II M M II I I I I M I II M 

Db 960 TGTGTGAATGGAGGCGTGTGCTACTACATCGAAGGCATCAACCAACTCTCCTGCAAGTGT 1019 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAGTTCGCAATGGTCAACTTCTCCTA 896 

M I II M I M II I II I I II M II I II I II M M II I II I I I M II II II II I II I 

Db 1020 CCTGTGGGATACACCGGGGACAGGTGTCAGCAGTTCGCAATGGTCAACTTCTCCAA 1075 



RESULT 5 
AAT87923 
ID 
XX 



AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 



AAT87923 standard; cDNA; 1803 BP. 
AAT87923; 

18-DEC-1997 (first entry) 

Rat cerebellum derived growth factor 2 cDNA. 

Rat; cerebellum derived growth factor; CDGF2 ; screening; binding; 
modulation; erbB type receptor; identification; indication; risk; 
proliferation; differentiation; induction; neuron; hyperplasia; 
stem cell culture; intracerebral graft; alleviation; repair; 
behavioural defect; nervous system; central; peripheral; nerve; 
prothesis; damage; entubulation; cell survival; treatment; injury; 
trauma; ischaemia; ischemia; stroke; infection; disorder; inflammation; 
neurodegeneration; disease; Parkinson* s; Huntingdon* s; 
amylotrophic lateral sclerosis; sensory; retina; 
spinocerebellar degeneration; multiple sclerosis; neoplasia; 
amalignant glioma; medulloblastoma; neuroectodermal tumour; ds . 



Rattus rattus , 

Key 
CDS 

sig_peptide 
mat_peptide 

WO9709425-A1. 



Location/ Qualifiers 
1. .993 
/■^tag= a 
1. .69 
/*tag= b 
70. .990 
/*tag= c 
/product= 



"cerebellum derived_growth_f actor " 



PD 13-MAR-1997. 
XX 

PF 09-SEP-1996; 96WO-US0144 84 . 
XX 

PR 08-SEP-1995; 95US-005258 64 . 
XX 

PA (HARD ) HARVARD COLLEGE, 

PA (STRD ) UNIV LELAND S STANFORD. 

XX 

PI Chang H; 
XX 

DR WPI; 1997-192900/17. 

DR P-PSDB; AAW27537. 
XX 

PT Rat and human cerebellum-derived growth factors - used in the treatment 

PT of neuronal injury and proliferative disorders. 

XX 

PS Claim 17; Page 70-74; 94pp; English. 
XX 

CC The present sequence encodes rat cerebellum derived growth factor 2 

CC (CDGF2), which can be used to screen for modulators of CDGF binding to 

CC erbB type receptors. Identification of a modification or mutation in a 

CC CDGF gene, or aberrant expression of a CDGF gene or levels of soluble 

CC CDGF may be used to indicate the risk of unwanted cell proliferation or 

CC differentiation. CDGF may be used to induce neuronal differentiation in 

CC stem cell culture, and maintain the integrity of a terminally 

CC differentiated neuronal cell culture, e.g. useful for intracerebral 

CC grafting to alleviate behavioural defects . CDGF may also be used in nerve 

CC protheses to repair central and peripheral nerve damage, especially where 

CC a crushed or severed axon is entubulated by a prosthetic. CDGF may also 

CC be used to enhance neuronal cell survival in the central or peripheral 

CC nervous system, to treat neurological conditions associated with nervous 

CC system injury, e.g. traumatic, chemical or vasal injury and deficits such 

CC as ischaemia resulting from stroke, infectious/inflammatory and tumour 

CC induced injury, chronic neurodegenerative disease including Parkinson* s 

CC and Huntingdon's, amylotrophic lateral sclerosis, spinocerebellar 

CC degeneration, chronic immunological disease of the nervous system 

CC including multiple sclerosis, disorders of the sensory neurons and 

CC degenerative diseases of the retina. CDGF may also be used to treat 

CC neoplastic or hyperplastic transformations, particularly of the central 

CC nervous system, e.g. amalignant gliomas, medulloblastomas and 

CC neuroectodermal tumours 

XX 

SQ Sequence 1803 BP; 408 A; 549 C; 537 G; 309 T; 0 U; 0 Other; 



Query Match 82.3%; Score 738.6; DB 2; Length 1803; 

Best Local Similarity 90.4%; Pred. No. 3e-159; 

Matches 78 9; Conservative 0; Mismatches 84; Indels 0; Gaps 0; 
Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

M I I I I I I I I I M I I I I I I I M I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 

Db 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCGATGCTGCTCTTCGGTGTGTCACTCGCCTGC 60 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTAC7\AGGCACCCGTGGTGGTGGAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 61 TACTCGCCCAGCCTCAAGTCCGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 120 



Qy 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 180 

I I I I I I I I I I I I I I I I I I III II I I I I I I I I I I I I I II II I I I I I I I I I I II I 
Db 121 GGCAAGGTACAGGGACTGGCCCCGGCAGGCGGTTCCAGCTCTAACAGCACCCGAGAGCCT 180 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 240 

I I I I I I I I I I I I I I I I I I I I I MM I II I I II I I I II I I I M I I M II I II I II I II I 

Db 181 CCCGCCTCGGGTCGGGTGGCGCTGGTGAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

II M I II I I M I I I I I II II I I I M I M I I I I I M I I I I I M I II II M II M M I I I 

Db 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGCGCGCCGCTCGAT^AGGAACCAG 300 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I I I II II II II I I I I II I II I I I I I I II M II I I M I M I II II I II I I II I M M 

Db 301 CGCTACATCTTTTTCCTGGAGCCCACCGAGCAGCCCTTAGTTTTTAAGACAGCCTTTGCC 360 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGT^AAGAGGTGGGCAAGATCCTGTGCACTGAC 420 

II MM I II I I II II I II I II I I I I I I II M I I I I I I II M I I I II I I I M I I 

Db 361 CCGGTCGACCCTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 420 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 4 80 

I I I I I I I II I I I I I II I I I I I I I II I M I M II Mill Ml I II I M I M M II 

Db 421 TGCGCAACCCGGCCCAAGCTGAAGAAGATGAAGAGTCAGACAGGAGAGGTGGGCGAGAAG 4 80 

Qy 4 81 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 540 

II I II I I II I II II II I I II II II II II I II II I I I II I M I II I I I I I I 

Db 481 CAGTCGCTCAAGTGTGAGGCGGCGGCGGGGAACCCCCAGCCCTCCTATCGATGGTTCAAG 54 0 

Qy 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

II I M I M II II II M II I II II M M M I I I M II I I M I I II I I M II I I I I I 

Db 541 GACGGCAAGGAGCTCAACCGGAGTCGTGACATTCGCATCAAGTATGGCAACGGCAGAAAG 600 

Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

M I M I I I I M I I I I I I I I II I I I I I I I II I I I II I II I I I I I I I II I II M III 
Db 601 AACTCACGGCTACAGTTCAACAAAGTGAAGGTGGAGGACGCTGGAGAGTACGTCTGTGAG 660 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 720 

II II I I I I II I I I I I M II I I M I II I M M I M I I I I I I I M M MUM 

Db 661 GCTGAGAACATCCTTGGGAAGGACACTGTGAGGGGCCGGCTCCATGTCAACAGTGTGAGC 720 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 780 

II I I I I I II I II I M II II I II I M II M M I I I I I I I I II I I I M II I I I I I I I I 

Db 721 ACCACTCTGTCGTCCTGGTCGGGGCACGCCCGGAAGTGCAATGAGACAGCCAAGTCCTAC 780 

Qy 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 840 

II II I I M I II II I I I I I I II I II I I I I I I I II I II II II M I II I I I I II III 
Db 781 TGTGTGAATGGAGGCGTGTGCTACTACATCGAAGGCATCAACCAACTCTCCTGCAAATGT 84 0 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

M. I I M I III I I I I I I III 
Db 841 CCAAACGGATTCTTCGGACAGAGATGTTTGGAG 873 

RESULT 6 
AAV43674 

ID AAV43674 standard; cDNA; 3076 BP. 
XX 



AC 


AAV43674; 




XX 






DT 


29-SEP-1998 ( 


first entry) 


XX 






DE 


Receptor type 


tyrosine kinase ErbB ligand encoding cDNA. 


XX 






KW 


Receptor type 


tyrosine kinase ErbB; ligand; diagnostic agent; 


KW 


nervous disease; cancer; ss. 


XX 






OS 


Rattus sp . 




XX 






FH 


Key 


Location/Qualifiers 


FT 


CDS 


232. .2814 


FT 




/*tag= a 


FT 




/product^ "ligand of receptor type tyrosine kinase ErbB" 


XX 






PN 


JP10179166-A. 




XX 






PD 


07-JUL-1998. 




XX 






PF 


25-DEC-1996; 


96JP-00356998 . 


XX 






PR 


25-DEC-1996; 


96JP-00356998. 


XX 






PA 


(HIGA/) HIGASHIYAMA S, 


XX 






DR 


WPI; 1998-430952/37. 


DR 


P-PSDB; AAW63700. 


XX 






PT 


Gene coding the ligand of the tyrosine kinase ErbB receptor - useful for 


PT 


diagnosing and 


, treating nervous diseases and cancer. 


XX 






PS 


Example; Page 


9-13; 17pp; Japanese. 


XX 






cc 


This cDNA encodes the ligand of receptor type tyrosine kinase ErbB. A 


cc 


prokaryotic or 


eukaryotic host cell transformed by a recombinant vector 


cc 


containing the 


encoding DNA can be used for the recombinant production of 


cc 


the protein. The invention provides a method for inhibiting the formation 


cc 


of the ligand 


of receptor type tyrosine kinase ErbB in an animal using an 


cc 


antibody recognizing the protein. The ligand of the tyrosine kinase ErbB 


cc 


receptor and associated materials can be used for treating or diagnosing 


cc 


nervous diseases and cancers 


XX 






SQ 


Sequence 3076 


BP; 673 A; 996 C; 944 G; 463 T; 0 U; 0 Other; 



Query Match 82.2%; Score 737; DB 2; Length 3076; 

Best Local Similarity 90.3%; Pred. No. 7.9e-159; 

Matches 788; Conservative 0; Mismatches 85; Indels 0; Gaps 0; 



Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I M I I I I I I I I I I I II I I I I I 
Db 556 ATGAGGCGCGACCCGGCCCCCGGCTCCTCGATGCTGCTCTTCGGTGTGTCACTCGCCTGC 615 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 120 

I I I I I I I I I I I I I I I I I I II I I I I I I I II I M I I M I M I I I I I I I I I I I I I I II I I I I 
Db 616 TACTCGCCCAGCCTCAAGTCCGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 675 



Qy 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 18 0 

I I I I I I I I I M I I I I I I I III II I I I I I I M I I I I I I I II I I Ml I I I I I I II 
Db 676 GGCAAGGTACAGGGACTGGCCCCGGCAGGCGGTTCCAGCTCTAACAGCACCCGAGAGCCT 735 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

I I I I I I I I I I M I II I I I I I I I I I I I I I I I I I I I I I I I II M II I I I I I I I I I I I I I I 

Db 736 CCCGCCTCGGGTCGGGTGGCGCTGGTGAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 7 95 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

Db 796 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGCGCGCCGCTCGAAAGGAACCAG 855 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I II I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I I II II 

Db 856 CGCTACATCTTTTTCCTGGAGCCCACCGAGCAGCCCTTAGTTTTTAAGACAGCCTTTGCC 915 

Qy 361 CCCCTCGATACC7VACGGCAAAAATCTCAAGAAAGAGGTGGGC7\AGATCCTGTGCACTGAC 42 0 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II 

Db 916 CCGGTCGACCCTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 975 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I III I I I I I I I I I I I I I 

Db 976 TGCGCAACCCGGCCCAAGCTGAAGAAGATGAAGAGTCAGACAGGAGAGGTGGGCGAGAAG 1035 

Qy 481 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 540 

II I I I I I I I I I I II I I M II II II II I I I I I I I I I I I I I II M I I I I I I I 

Db 1036 CAGTCGCTCAAGTGTGAGGCGGCGGCGGGGAACCCCCAGCCCTCCTATCGATGGTTCAAG 1095 

Qy 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

II II M I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II 
Db 1096 GACGGCAAGGAGCTCAACCGGAGTCGTGACATTCGCATCAAGTATGGCAACGGCAGAAAG 1155 

Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I M II Ml 

Db 1156 AACTCACGGCTACAGTTCAACAAAGTGAAGGTGGAGGACGCTGGAGAGTACGTCTGTGAG 1215 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 72 0 

II I I II I I I I I I I I I I I I I I I I I I II I I II I M I M I II II I I II I II I II 

Db 1216 GCTGAGAACATCCTTGGGAAGGACACTGTGAGGGGCCGGCTCCATGTCAACAGTGTGAGC 1275 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 7 80 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 

Db 1276 ACCACTCTGTCGTCCTGGTCGGGGCACGCCCGGAAGTGCAATGAGACAGCCAAGTCCTAC 1335 

Qy 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 840 

II II II M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I III 

Db 1336 TGTGTGAATGGAGGCGTGTGCTACTACATCGAAGGCATCAACCAACTCTCCTGCAAATGT 1395 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

II I I I I I III I I I I I I IN 

Db 1396 CCAAACGGATTCTTCGGACAGAGATGTTTGGAG 1428 



RESULT 7 
ABS56035 

ID ABS56035 standard; cDNA; 1863 BP. 
XX 



AC ABS56035; 
XX 

DT 14-JAN-2003 (first entry) 
XX 

DE cDNA encoding human membrane-bound splice variant of Don-1. 
XX 

KW Human; Don-1; epidermal growth factor; EGF; neuregulin; 

KW glycoprotein ligand; cell proliferation; cell proliferative disorder; 

KW carcinoma; adenocarcinoma cell; myeloma; cell differentiation; 

KW cell survival; epithelial cell; wound healing; tumour formation; brain; 

KW vulnerary; cytostatic; gene therapy; gene; ss. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 643, .1863 

FT /^tag= a 

FT /partial 

FT /product- "Membrane-bound splice variant of Don-1" 

FT /note= "This sequence lacks a stop codon" 

XX 

PN US2002127594-A1. 
XX 

PD 12-SEP-2002. 
XX 

PF 12-MAR-2002; : 2002US-00096241 . 
XX 

PR 22-JUN-2000; 2000US-00599789 . 
XX 

PA (GEAR/) GEARING D P. 

PA (BUSF/) BUSFIELD S J. 
XX 

PI Gearing DP, Busfield SJ; 
XX 

DR WPI; 2003-039584/03. 

DR P-PSDB; MG71638. 
XX 

PT Novel Don-1 polypeptide useful for stimulating proliferation of cells, 

PT for identifying proteins that interact with Don-1, and for regulating 

PT tumor formation and progression in brain. 
XX 

PS Claim 4; Fig 3; 66pp; English. 
XX 

CC The present invention relates to the isolation of a novel gene called Don 

CC -1, and alternate splice variants of Don-1, which are related to 

CC epidermal growth factors (EGF) such as neuregulins . Don-1 polypeptides 

CC are glycoprotein ligands. Both murine and human Don-1 sequences are 

CC cloned. The mouse Don-1 gene maps to chromosome 18. Don-1 polypeptides 

CC are useful for stimulating proliferation of a cell. Antibodies to Don-1 

CC polypeptides are useful for detecting Don-1 in a sample. The Don-1 

CC polypeptides are useful for treating and diagnosing cell proliferative 

CC disorders and play a role in the proliferation of carcinomas e.g. 

CC adenocarcinoma, myeloma, in cell differentiation, proliferation and 

CC survival. The polypeptides are also useful for inhibiting proliferation 

CC of adenocarcinoma cells, for stimulating the proliferation of cells such 

CC as epithelial cells to promote wound healing, for identifying proteins 

CC that interact with Don-1, and for regulating tumour formation and 



CC progression in the brain. The polynucleotide sequences encoding Don-1 may 

CC be used in gene therapy. The present sequence encodes human membrane- 

CC bound splice variant of Don-1 
XX 

SQ Sequence 1863 BP; 422 A; 602 C; 553 G; 286 T; 0 U; 0 Other; 

Query Match 80.6%; Score 723.4; DB 7; Length 1863; 

Best Local Similarity 96.2%; Pred. No. 9.2e-156; 

Matches 840; Conservative 0; Mismatches 16; Indels 17; Gaps 9; 

Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 213 ATGAGGCGCGACCCGGCCCCC — CTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 270 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 271 TACTCGCCCAGCCTCAAGTCA — GCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 32 8 

Qy 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 32 9 GGCAAGGTACAGGGGCTGGT--CAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 386 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I 
Db 387 CCCGCCTCGGGTCGGGTGGCG — GGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 444 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I 
Db 445 GGGCTGCAGCGCGAGCAGGTG — CAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 502 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M M I I I I I I I I I 
Db 503 CGCTACATCTTTTTCCTGGAG — CACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 560 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 561 CCCCT-GATACCAACGGCAAAA — CTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGGC 617 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 618 TGCGCCACCCGGCCCAAGTTGA— AAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 675 

Qy 481 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 540 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I M I I I I I I I I I I I I I 
Db 676 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 735 

Qy 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 73 6 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 795 

Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 796 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 855 



Qy 

Db 



661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 720 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
856 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 915 



Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 780 

I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 916 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 975 

Qy 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I M I I I I I III 

Db 976 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGT 1035 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

II I I I I I III I I I I I I III 

Db 1036 CCAAATGGATTCTTCGGACAGAGATGTTTGGAG 1068 



RESULT 8 




AAV17816 




ID 


AAV17816 standard; cDNA; 2268 BP. 


XX 






AC 


AAV17816; 




XX 






DT 


17-AUG-1998 (first entry) 


XX 






DE 


Homo sapiens don-1 gene splice variant. 


XX 






KW 


Murine; don-1 gene; melanoma; treatment; adenocarcinoma; epithelial cell, 


KW 


proliferation; 


stimulation; treatment; tumours; skin; oesophagus; lung; 


KW 


breast; liver; 


pancreas; colon; prostate; gastrointestinal tract; uterus, 


KW 


wound healing; 


transmembrane ; s s . 


XX 






OS 


Homo sapiens . 




XX 






FH 


Key 


Location/Qualifiers 


FT 


CDS 


69. .2012 


FT 




/*tag= a 


FT 




/note= "don-1 polypeptide" 


XX 






PN 


WO9807736-A1. 




XX 






PD 


26-FEB-1998. 




XX 






PF 


18-AUG-1997; 


97WO-US014585. 


XX 






PR 


19-AUG-1996; 


96US-00699591. 


PR 


19-NOV-1996; 


96US-00753007. 


XX 






PA 


(MILL-) MILLENNIUM BIOTHERAPEUTICS INC. 


XX 






PI 


Gearing DP^ Busfield SJ; 


XX 






DR 


WPI; 1998-169084/15. 


DR 


P-PSDB; AAW4838 


3. 


XX 






PT 


Mouse and human 


don-1 polypeptide (s) - useful for treatment of melanomas 


PT 


and adenocarcinoma (s) , and for wound healing. 


XX 






PS 


Claim 4; Fig 7; 


121pp; English, 


XX 







CC The sequence is that of a human don-1 gene splice variant. Don-1 

CC polypeptides stimulate proliferation of epithelial cells and thus are 

CC implicated in melanomas and adenocarcinomas in which epithelial cells 

CC proliferate out of control. Compounds that interfere with don-1 mediated 

CC cell proliferation can be used in the treatment of tumours such as 

CC melanomas and adenocarcinomas of the skin, oesophagus, lung, breast, 

CC liver, pancreas, gastrointestinal tract, colon, prostate or uterus. 

CC Alternatively, don-1 polypeptides can be used to stimulate epithelial 

CC cell proliferation, e.g. for wound healing 

XX 

SQ Sequence 2268 BP; 502 A; 735 C; 700 G; 331 T; 0 U; 0 Other; 

Query Match 47.7%; Score 427.8; DB 2; Length 2268; 

Best Local Similarity 89.8%; Pred. No. 4.1e-88; 

Matches 459; Conservative 0; Mismatches 52; Indels 0; Gaps 0; 

Qy 363 CCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTG 422 

I I I I I I I III I I II I I I I I I I I II 

Db 98 CCGCGGCAAGAAGCACCCAGAGGGGAGGAAGCGGGAGAGGGAGCCCGATCCCGGGGAGAA 157 

Qy 423 CGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 4 82 

I M I I I I I I I I I I II I M I I I I I I I II I II I M I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 158 AGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 217 

Qy 4 83 ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 542 

I I I I I I I I I I M M I I I I I M I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I M 
Db 218 ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 277 

Qy 543 TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 602 

I I I I I I I II I I I I I I I I I I I I I I I I I II II I I I I I I II I I M II I II I I I I I I I I II I I I 
Db 27 8 TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 337 

Qy 603 CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 662 

M M I I I I I I I M I I I I I I I M I II I I I I I I II I I I I I II I I I I I I I I I M II I I I I I I I 
Db 338 CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 397 

Qy 663 CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 722 

I I I II I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I i I I I M I I I I I I I I I I II 
Db 398 CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 457 

Qy 723 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 782 

I I I M I I I I II I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I II I I I I I I I I I 
Db 458 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 517 

Qy 783 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCC 842 

I I I I I I I I I M I I I I I I I I M I I II I I I I I II II I I M I I I I I I I I I I I I I I I I I I I I I 
Db 518 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCC 577 

Qy 843 TGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

I I I I I III I I I I I I III 
Db 578 AAATGGATTCTTCGGACAGAGATGTTTGGAG 608 



RESULT 9 
ABS56036 

ID ABS56036 standard; cDNA; 1474 BP. 
XX 



AC ABS56036; 
XX 

DT 14-JAN~2003 (first entry) 
XX 

DE cDNA encoding human second splice variant of Don-1. 
XX 

KW Human; Don-1; epidermal growth factor; EGF; neuregulin; 

KW glycoprotein ligand; cell proliferation; cell proliferative disorder; 

KW carcinoma; adenocarcinoma cell; myeloma; cell differentiation; 

KW cell survival; epithelial cell; wound healing; tumour formation; brain; 

KW vulnerary; cytostatic; gene therapy; gene; ss. 

XX 

OS Homo sapiens . 



XX 

FH Key Location/Qualifiers 

FT CDS 68. .1473 

FT /*tag= a 

FT /partial 

FT /product= "Second splice variant of Don-1" 

FT /note= "This sequence lacks a stop codon" 

FT /transl_except= (pos:107. .108, aa : Lys ) 

FT /note= "This codon has an apparent 1 nucleotide deletion 

FT which alters the reading frame" 

XX 



PN US2002127594-A1. 
XX 

PD 12-SEP-2002, 
XX 

PF 12-MAR-2002; 2002US-00096241 . 
XX 

PR 22-JUN-2000; 2000US-00599789 , 
XX 

PA (GEAR/) GEARING D P. 

PA (BUSF/) BUSFIELD S J. 
XX 

PI Gearing DP, Busfield SJ; 
XX 

DR WPI; 2003-039584/03. 

DR P-PSDB; ABG71639. 
XX 

PT Novel Don-1 polypeptide useful for stimulating proliferation of cells, 

PT for identifying proteins that interact with Don-1, and for regulating 

PT tumor formation and progression in brain. 
XX 

PS Claim 4; Fig 4; 66pp; English. 
XX 

CC The present invention relates to the isolation of a novel gene called Don 

CC -1, and alternate splice variants of Don-1, which are related to 

CC epidermal growth factors (EGF) such as neuregulins . Don-1 polypeptides 

CC are glycoprotein ligands. Both murine and human Don-1 sequences are 

CC cloned. The mouse Don-1 gene maps to chromosome 18. Don-1 polypeptides 

CC are useful for stimulating proliferation of a cell. Antibodies to Don-1 

CC polypeptides are useful for detecting Don-1 in a sample. The Don-1 

CC polypeptides are useful for treating and diagnosing cell proliferative 

CC disorders and play a role in the proliferation of carcinomas e.g. 

CC adenocarcinoma, myeloma, in cell differentiation, proliferation and 

CC survival. The polypeptides are also useful for inhibiting proliferation 



of adenocarcinoma cells, for stimulating the proliferation of cells such 
as epithelial cells to promote wound healing, for identifying proteins 
that interact with Don-1, and for regulating tumour formation and 
progression in the brain. The polynucleotide sequences encoding Don-1 may 
be used in gene therapy. The present sequence encodes human second splice 
variant of Don-1 

Sequence 1474 BP; 335 A; 472 C; 451 G; 216 T; 0 U; 0 Others- 
Query Match 47.6%; Score 426.8; DB 7; Length 1474; 
Best Local Similarity 92.4%; Pred. No. 6.3e-88; 

Matches 449; Conservative 0; Mismatches 37; Indels 0; Gaps 0; 

Qy 38 8 AAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCCGGCCCAAGTTGAAGAAG 447 

INI I I I I II I I II I M II I I I I I II I I I I I I II M I I 

Db 121 AGGAAGCGGGAGAGGGAGCCCGATCCCGGGGAGAAAGCCACCCGGCCCAAGTTGAAG/^G 180 

Qy 448 ATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAATCGCTGAAGTGTGAGGCAGCAGCC 507 

I I I I II I I I I M I I I I I II I I I I I I II M I I I I I I I I I I I I I II II II I II I I I I I I I I I 
Db 181 ATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAATCGCTGAAGTGTGAGGCAGCAGCC 24 0 

Qy 508 GGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGGCAAGGAGCTCAACCGCAGCCGA 567 

I II I I I I I I I II I I I M I I I M I I I I I II I I I I I I I II II I I I I I I i I II I I I II I I I I I 
Db 241 GGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGGCAAGGAGCTCAACCGCAGCCGA 300 

Qy 568 GACATTCGCATCAAATATGGCi^ACGGCAGAAAGAACTCACGACTACAGTTCAACi^AGGTG 627 

I M I II I I M M I I I I I I I I I M I II I I I I I II I M I I I I M II I I I I I II I I I I II I I I 

Db 301 GACATTCGCATCAAATATGGCAACGGCAGAAAGAACTCACGACTACAGTTCAACAAGGTG 360 

Qy 628 AAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGAGAACATCCTGGGGAAGGACACC 687 

I M I I M I I II I I II I I I I I I M I I I I I I I I I M I I I I I II I I I I I I I I I I II 

Db 361 AAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGAGAACATCCTGGGGAAGGACACC 420 

Qy 688 GTCCGGGGCCGGCTTTACGTC7UVCAGCGTGAGCACCACCCTGTCATCCTGGTCGGGGCAC 747 

I I I I I I II I I I I I M II I I I I I I I I I I I M II I I I II M I I I I I I I I I I I I I I I M I I I I 

Db 421 GTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCACCACCCTGTCATCCTGGTCGGGGCAC 480 

Qy 74 8 GCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTGCGTCAATGGAGGCGTCTGCTACTAC 807 

I I M I M I I M I I I I I I I I I I I I I M I I I I I I I I M I I M I I M I I I M I I I I M I M I I 

Db 4 81 GCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTGCGTCAATGGAGGCGTCTGCTACTAC 54 0 

Qy 808 ATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCCTGTGGGATACACCGGGGACAGGTGT 867 

I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I I I I II I III I II III 

Db 541 ATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCCAAATGGATTCTTCGGACAGAGATGT 600 

Qy 868 CAGCAG 873 

I I I 

Db 601 TTGGAG 606 



RESULT 10 
ABS56045 

ID ABS56045 standard; cDNA; 2266 BP. 
XX 

AC ABS56045; 
XX 

DT 14-JAN-2003 (first entry) 



CC 
CC 
CC 
CC 
CC 
CC 
XX 
SQ 



XX 

DE cDNA encoding human third splice variant of Don-1. 
XX 

KW Human; Don-1; epidermal growth factor; EGF; neuregulin; 

KW glycoprotein ligand; ceil proliferation; cell proliferative disorder; 

KW carcinoma; adenocarcinoma cell; myeloma; cell differentiation; 

KW cell survival; epithelial cell; wound healing; tumour formation; brain; 

KW vulnerary; cytostatic; gene therapy; gene; ss. 

XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualif iers 

FT CDS 68. .2010 

FT /*tag= a 

FT /product= "Third splice variant of Don-1" 

FT /transl_except= {pos:107. .108, aa:Lys) 

FT /note= "This codon has an apparent 1 nucleotide deletion 

FT which alters the reading frame" 

FT /transl_except= (pos:994. .996, aa:Thr) 

XX 



PN US2002127594-A1. 
XX 

PD 12-SEP-2002. 
XX 

PF 12-MAR-2002; 2002US-00096241 . 
XX 

PR 22-JUN-2000; 2000US-00599789 . 
XX 

PA (GEAR/) GEARING D P. 

PA (BUSF/) BUSFIELD S J. 
XX 

PI Gearing DP, Busfield SJ; 
XX 

DR WPI; 2003-039584/03, 

DR P-PSDB; ABG71644. 
XX 

PT Novel Don-1 polypeptide useful for stimulating proliferation of cells, 

PT for identifying proteins that interact with Don-1, and for regulating 

PT tumor formation and progression in brain. 
XX 

PS claim 4; Fig 7; 66pp; English. 
XX 

CC The present invention relates to the isolation of a novel gene called Don 

CC -1, and alternate splice variants of Don-1, which are related to 

CC epidermal growth factors (EGF) such as neuregulins . Don-1 polypeptides 

CC are glycoprotein ligands . Both murine and human Don-1 sequences are 

CC cloned. The mouse Don-1 gene maps to chromosome 18. Don-1 polypeptides 

CC are useful for stimulating proliferation of a cell. Antibodies to Don-1 

CC polypeptides are useful for detecting Don-1 in a sample. The Don-1 

CC polypeptides are useful for treating and diagnosing cell proliferative 

CC disorders and play a role in the proliferation of carcinomas e.g. 

CC adenocarcinoma, myeloma, in cell differentiation, proliferation and 

CC survival. The polypeptides are also useful for inhibiting proliferation 

CC of adenocarcinoma cells, for stimulating the proliferation of cells such 

CC as epithelial cells to promote wound healing, for identifying proteins 

CC that interact with Don-1, and for regulating tumour formation and 

CC progression in the brain. The polynucleotide sequences encoding Don-1 may 



CC be used in gene therapy. The present sequence encodes human third splice 

CC variant of Don-1 

XX 

SQ Sequence 2266 BP; 502 A; 733 C; 700 G; 331 T; 0 U; 0 Others- 



Query Match 47.6%; Score 426,8; DB 7; Length 2266; 

Best Local Similarity 92.4%; Pred. No. 6.9e-88; 

Matches 449; Conservative 0; Mismatches 37; Indels 0; Gaps 0; 

Qy 38 8 AAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCCGGCCCAAGTTGAAGAAG 447 

MM I I I I II I I II I I II I I I II I I I I II I I I I I I I I I 

Db 121 AGGAAGCGGGAGAGGGAGCCCGATCCCGGGGAGAAAGCCACCCGGCCCAAGTTGAAGAAG 18 0 

Qy 44 8 ATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAATCGCTGAAGTGTGAGGCAGCAGCC 507 

I II I I I I I I I I I I I I I I I I I I M I I II I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I 
Db 181 ATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAATCGCTGAAGTGTGAGGCAGCAGCC 24 0 

Qy 50 8 GGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGGCAAGGAGCTCAACCGCAGCCGA 567 

I M I I I I I I I I I II I I I I I I I I M I II I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I 

Db 241 GGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGGCAAGGAGCTCAACCGCAGCCGA 300 

Qy 568 GACATTCGCATCAAATATGGCAACGGCAGAAAGAACTCACGACTACAGTTCAACAAGGTG 627 

II I I I I I I I I I I II I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 GACATTCGCATCAAATATGGCAACGGCAGAAAGAACTCACGACTACAGTTCAACAAGGTG 360 

Qy 62 8 AAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGAGTU^CATCCTGGGGAAGGACACC 687 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 361 AAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGAGAACATCCTGGGGAAGGACACC 420 

Qy 688 GTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCACCACCCTGTCATCCTGGTCGGGGCAC 747 

I II I I I I M I I I II I I II II I I I I I I II I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I 
Db 421 GTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCACCACCCTGTCATCCTGGTCGGGGCAC 4 80 

Qy 748 GCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTGCGTCAATGGAGGCGTCTGCTACTAC 8 07 

I II I I I I I I II I I I I I I I I I I I M M I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 481 GCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTGCGTCAATGGAGGCGTCTGCTACTAC 54 0 

Qy 8 08 ATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCCTGTGGGATACACCGGGGACAGGTGT 8 67 

II I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I Ml I I I I I I 

Db 541 ATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCCAAATGGATTCTTCGGACAGAGATGT 600 

Qy 8 68 CAGCAG 873 

I I I 

Db 601 TTGGAG 606 



RESULT 11 
AAV17815 

ID AAV17815 standard; cDNA; 1476 BP, 
XX 

AC AAV17815; 
XX 

DT 17-AUG-1998 (first entry) 
XX 

DE Homo sapiens don-1 gene splice variant. 
XX 

KW Murine; don-1 gene; melanoma; treatment; adenocarcinoma; epithelial cell; 



KW 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
XX 
PA 
XX 

PI 

XX 
DR 
DR 
XX 
PT 
PT 
XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

XX 
SQ 



proliferation; stimulation; treatment; tumours; skin; oesophagus; lung; 
breast; liver; pancreas; colon; prostate; gastrointestinal tract; uterus; 
wound healing; transmembrane; ss. 



Homo sapiens, 

Key 
CDS 



WO9807736-A1, 



26-FEB-1998. 



Location/Qualifiers 
69. .1475 
/*tag= a 

/note= "don-1 polypeptide" 



18-AUG-1997; 97WO-US014585 . 



19-AUG-1996; 
19-NOV-1996; 



96US-00699591, 
96US-00753007. 



(MILL-) MILLENNIUM BIOTHERAPEUTICS INC. 

Gearing DP, Busfield SJ; 

WPI; 1998-169084/15. 
P-PSDB; AAW48382. 

Mouse and human don-1 polypeptide ( s ) - useful for treatment of melanomas 
and adenocarcinoma ( s ) , and for wound healing. 

Claim 4; Fig 4; 121pp; English. 

The sequence is that of a human don-1 gene splice variant. Don-1 
polypeptides stimulate proliferation of epithelial cells and thus are 
implicated in melanomas and adenocarcinomas in which epithelial cells 
proliferate out of control. Compounds that interfere with don-1 mediated 
cell proliferation can be used in the treatment of tumours such as 
melanomas and adenocarcinomas of the skin, oesophagus, lung, breast, 
liver, pancreas, gastrointestinal tract, colon, prostate or uterus. 
Alternatively, don-1 polypeptides can be used to stimulate epithelial 
cell proliferation, e.g. for wound healing 

Sequence 1476 BP; 335 A; 475 C; 450 G; 216 T; 0 U; 0 Other; 



Query Match 47.5%; Score 426.2; DB 2; Length 1476; 

Best Local Similarity 89.6%; Pred. No. 8.6e-88; 

Matches 458; Conservative 0; Mismatches 53; Indels 0; Gaps 0; 

Qy 363 CCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTG 422 

I I I I I M III I I II I I I I I I I I M 

Db 98 CCGCGGCAAGAAGCACCCAGAGGGGAGGAAGCGGGAGAGGGAGCCCGATCCCGGGGAGAA 157 

Qy 423 CGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 482 

I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 158 AGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 217 



Qy 4 83 ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 542 



Db 218 ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 277 

Qy 543 TGGCi^AGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 602 

I I I M I I I I M I I I I I I M M M I I I I I I I I I I I I M I M I I I M I I M I M M I I I I I I 

Db 278 TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 337 

Qy 603 CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 662 

I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I 
Db 338 CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 397 

Qy 663 CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 722 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 398 CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 457 

Qy 723 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 782 

I I I I I I I I I I I M I I I I I I I M I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I M I I 

Db 458 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 517 

Qy 7 83 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCT^CCAGCTCTCCTGCAAGTGTCC 842 

II I I I I I I M I I I I I M I I I I I I M I I I M I I I I I I I I I I 

Db 518 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCC 577 

Qy 843 TGTGGGATACACCGGGGACAGGTGTCAGCAG 8 73 

I I I I I II I I I I I I III 
Db 578 AT^TGGATTCTTCGCACAGAGATGTTTGGAG 608 



RESULT 12 
ABL40993 

ID ABL40993 standard; DNA; 1054 BP. 
XX 

AC ABL40993; 
XX 

DT 03-JUL-2002 (first entry) 
XX 

DE Human neuregulin 2 gene exon 1. 
XX 

KW Human; MUCl; mucin; glycoprotein; cytostatic; cancer; tumour; ECD; 

KW extracellular domain; neuregulin 2; isoform; gene; ds . 

XX 

OS Homo sapiens. 
XX 

PN WO200222685-A2. 
XX 

PD 21-MAR-2002. 
XX 

PF ll-SEP-2001; 2001WO-US028548 . 
XX 

PR ll-SEP-2000; 200 OUS-023184 IP . 
XX 

PA (KUFE/) KUFE D W. 

PA (OHNO/) OHNO T. 
XX 

PI Kufe DW, Ohno T; 
XX 

DR WPI; 2002-339864/37. 



XX 



PT Use of a mucin glycoprotein (MUCl) extracellular domain antagonist for 

PT manufacturing a medicant that inhibits the proliferation of MUC-1 

PT expressing cancer cells and that can treat cancers and reduce tumor 

PT growth. 
XX 

PS Disclosure; Page 61-62; 74pp; English. 
XX 

CC The invention relates to the use of a MUCl (mucin glycoprotein) 

CC extracellular domain (ECD) antagonist for the manufacture of a medicant 

CC to inhibit the proliferation of MUC-1 expressing cancer cells. MUCl ECD 

CC antagonists (optionally combined with a pharmaceutical carrier) can be 

CC administered to inhibit proliferation of MUCl-expressing cancer cells, 

CC useful to treat cancers e.g. skin cancer, prostate cancer and leukemia, 

CC especially in humans. The method may also be combined with administration 

CC of a chemotherapeutic agent (e.g. an alkylating agent, topisomerase etc) 

CC or radiation to treat cancer, especially to reduce tumour growth. The 

CC polypeptides are also useful in screening to identify MUCl ECD 

CC antagonists. The present sequence represents an exon fragment of the 

CC human neuregulin 2 gene 

XX 

SQ Sequence 1054 BP; 178 A; 367 C; 361 G; 148 T; 0 U; 0 Other; 



Query Match 47.3%; Score 424; DB 6; Length 1054; 

Best Local Similarity 100.0%; Pred. No. 2.5e-87; 

Matches 424; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 


60 






1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 




Db 


589 


ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 


648 


Qy 


61 


TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 


120 






1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 { 1 1 1 1 1 1 




Db 


649 


TACTCGCCCAGCCTC7iA.GTCAGTGCAGGACCAGGCGTACi\AGGCACCCGTGGTGGTGGAG 


708 


Qy 


121 


GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 


180 






1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 I 1 1 M 1 1 1 1 1 1 1 1 1 1 1 




Db 


709 


GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 


768 


Qy 


181 


CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 


240 






1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 I 1 M 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


769 


CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 


828 


Qy 


241 


GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAT^GGAACCAG 


300 






1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 M 1 1 1 1 




Db 


829 


GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 


888 


Qy 


301 


CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 


360 






1 1 1 1 1 1 M II 1 1 M M II M II 1 II 1 M II II II M 1 II II M M II M 1 II M M 1 II 1 




Db 


889 


CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 


948 


Qy 


361 


CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 


420 






M 1 II II II II 1 II M 1 1 1 1 II 1 1 1 1 II 1 1 II II M II II 1 M M 1 M II II M II II 1 1 




Db 


949 


CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 


lOOi 


Qy 


421 


TGCG 424 





1 1 1 1 



Db 



1009 TGCG 1012 



RESULT 13 
AAV17813 

ID AAV17813 standard; cDNA; 1607 BP. 
XX 

AC 7^V17813; 
XX 

DT 17-AUG-1998 (first entry) 
XX 

DE Mus musculus don-1 gene splice variant. 
XX 

KW Murine; don-1 gene; melanoma; treatment; adenocarcinoma; epithelial cell; 

KW proliferation; stimulation; treatment; tumours; skin; oesophagus; lung; 

KW breast; liver; pancreas; colon; prostate; gastrointestinal tract; uterus; 

KW wound healing; secreted protein; ss. 
XX 

OS Mus musculus. 
XX 

FH Key Location/Qualifiers 

FT CDS 79. .624 

FT /*tag= a 

FT /note= "secreted don-1 polypeptide" 
XX 

PN WO9807736-A1. 
XX 

PD 26-FEB-1998. 
XX 

PF 18-AUG-1997; 97WO-US01458 5 . 
XX 

PR 19-AUG-1996; 96US-00699591 . 

PR 19-NOV-1996; 96US-007530G7 . 
XX 

PA (MILL-) MILLENNIUM BIOTHERAPEUTICS INC. 
XX 

PI Gearing DP, Busfield SJ; 
XX 

DR WPI; 1998-169084/15. 

DR P-PSDB; AAW48380. 
XX 

PT Mouse and human don-1 polypeptide ( s ) - useful for treatment of melanomas 

PT and adenocarcinoma ( s ) , and for wound healing. 

XX 

PS Claim 4; Fig 2; 121pp; English. 
XX 

CC The sequence is that of a murine don-1 gene splice variant. Don-1 

CC polypeptides stimulate proliferation of epithelial cells and thus are 

CC implicated in melanomas and adenocarcinomas in which epithelial cells 

CC proliferate out of control. Compounds that interfere with don-1 mediated 

CC cell proliferation can be used in the treatment of tiomours such as 

CC melanomas and adenocarcinomas of the skin, oesophagus, lung, breast, 

CC liver, pancreas, gastrointestinal tract, colon, prostate or uterus. 

CC Alternatively, don-1 polypeptides can be used to stimulate epithelial 

CC cell proliferation, e.g. for wound healing 

XX 

SQ Sequence 1607 BP; 365 A; 500 C; 480 G; 262 T; 0 U; 0 Other; 



Query Match 45.2%; Score 405.4; DB 2; Length 1607; 

Best Local Similarity 87.9%; Pred. No. 5e-83; 

Matches 442; Conservative 0; Mismatches 61; Indels 0; Gaps 0 



Qy 


371 


CCAACGGCAAAT^TCTCAAGTVAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCC 

1 1 1 1 1 1 IN 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCC 


430 


Db 


2 


61 


Qy 


431 


GGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAATCGCTGA 

INN 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 Mill 1 

GGCCCAAGCTGAAGAAGATGAAGAGCCAGACAGGAGAGGTGGGTGAGAAGCAGTCGCTCA 


490 


Db 


62 


121 


Qy 


491 


AGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGGCAAGG 

1 N M II II 1 1 II II M M 1 M M Ml Mill II M 1 M 1 II II II 1 M M M 

AGTGTGAGGCAGCGGCGGGAAACCCCCAGCCCTCCTATCGCTGGTTCAAGGATGGCAAGG 


550 


Db 


122 


181 


Qy 


551 


AGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGC7\ACGGCAGAAAGAACTCACGAC 

1 1 1 1 1 1 1 1 1 M II II M 1 1 M M II 1 II 1 II 1 1 1 1 1 M 1 M 1 M 1 II II 1 1 

AACTCAACCGGAGTCGTGATATTCGCATCAAGTATGGCAATGTCAGAAAGAACTCACGGC 


610 


Db 


182 


241 


Qy 


611 


TACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGAGAACA 

N 1 1 N M II II 1 II II M 1 1 1 M M II M 1 II 1 1 1 1 II II II 1 II II II II 1 1 

TACAGTTCAACAAAGTGAGGGTGGAGGATGCCGGGGAGTACGTCTGTGAGGCCGAGAACA 


670 


Db 


242 


301 


Qy 


671 


TCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCACCACCCTGT 

UN 1 II II 1 M M II II 1 1 1 M M II 1 II 1 II II 1 II 1 1 1 1 II M II II 1 1 

TCCTTGGGAAGGACACCGTGAGGGGCCGACTCCATGTCAACAGCGTGAGCACCACTCTGT 


730 


Db 


302 


361 


Qv 

» J: 


731 


^■"•■L J- vjox v-ovjvj>aOY-i\^vjk^^_OkjLj>\M.Vj 1 \aO/\/\Uo/\ljAU/\tjUk^/\ACjl L-U i Ai l GCGTCAATG 

1 1 1 M 1 M II II 1 II M 1 II 1 1 II M M 1 M 1 1 i 1 II 1 II II 1 1 1 II II MM 

CATCCTGGTCGGGACATGCCCGGAAGTGCT^TGAGACCGCCAAGTCCTACTGTGTGAATG 


790 


Db 


362 


421 


Qy 


791 


GAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCCTGTGGGAT 

II IN 1 M 1 M II M M M 1 M II 1 II II II M M 1 II 1 

GAGGCGTGTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCCAAACGGAT 


850 


Db 


422 


481 


Qy 


851 


ACACCGGGGACAGGTGTCAGCAG 873 

1 Ml 1 II II 1 III 
TCTTCGGACAGAGATGTTTGGAG 504 




Db 


482 





RESULT 14 
AAV17812 

ID AAV17812 standard; cDNA; 2467 BP. 
XX 

AC AAV17812; 
XX 

DT 17-AUG-1998 (first entry) 
XX 

DE Mus musculus don-1 gene splice variant. 
XX 

KW Murine; don-1 gene; melanoma; treatment; adenocarcinoma; epithelial cell; 

KW proliferation; stimulation; treatment; tumours; skin; oesophagus; lung; 

KW breast; liver; pancreas; colon; prostate; gastrointestinal tract; uterus; 

KW wound healing; transmembrane; ss. 
XX 



OS 


Mus musculus , 




XX 






FH 


Key 


jj'w' u J. / ^ Lid J_ X X i- o 


FT 


CDS 


7 Q 1 ft Q^; 


FT 




/ ^- ciy — d 


FT 
XX 




/'no'f"^= T* a Ti Q Tn*iTnV^ y" a Ti & (H/^r^ "1 r\(^T 1 7>^ i=i>-\ ■♦- -1 ii 

/ iiu L-t; — L-i. dlitDiiiciluJ I. dlitr \_lvJlX X po_Lypepi_lQ.e 


PN 


WO9807736-A1 , 




XX 






PD 


26-FEB-1998 . 




XX 






PF 


18-AUG-1997; 


97WO-US014585. 


XX 






PR 


19-AUG-1996; 


96US-00699591. 


PR 


19-NOV-1996; 


96US-00753007. 


XX 






PA 


(MILL-) MILLENNIUM BIOTHERAPEUTICS INC. 


XX 






PI 


Gearing DP, 


Busfield SJ; 


XX 






DR 


WPI; 1998-169084/15. 


DR 


P-PSDB; AAW48379. 


XX 






PT 


Mouse and human don-1 polypeptide { s ) - useful for treatment of melanomas 


FT 


and adenocarcinoma (s) , and for wound healing. 


XX 






PS 


Claim 4; Fig 


1; 121pp; English. 


XX 






cc 


The sequence 


is that of a murine don-1 gene splice variant. Don-1 


cc 


polypeptides 


stimulate proliferation of epithelial cells and thus are 


cc 


implicated in 


melanomas and adenocarcinomas in which epithelial cells 


cc 


proliferate out of control. Compounds that interfere with don-1 mediated 


cc 


cell proliferation can be used in the treatment of tumours such as 


cc 


melanomas and 


adenocarcinomas of the skin, oesophagus, lung, breast. 


cc 


liver, pancreas, gastrointestinal tract, colon, prostate or uterus. 


cc 


Alternatively, don-1 polypeptides can be used to stimulate epithelial 


cc 


cell proliferation, e.g. for wound healinq 


XX 






SQ 


Sequence 24 67 


BP; 592 A; 752 C; 706 G; 417 T; 0 U; 0 Other; 



Query Match 44.8%; Score 402.2; DB 2; Length 2467; 

Best Local Similarity 87.5%; Pred. No. 3e-82; 

Matches 440; Conservative 0; Mismatches 63; Indels 0; Gaps 0; 
Qy 371 CCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCC 430 



2 CTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCC 61 



Qy 431 GGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAATCGCTGA 4 90 

I I I I M I I I M I M I I I I I I I I I I I I I I I I III I I I I I II I II I I I II I Mill I 
Db 62 GGCCCAAGCTGAAGAAGATGAAGAGCCAGACAGGAGAGGTGGGTGAG/^GCAGTCGCTCA 121 

Qy 4 91 AGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGGCAAGG 550 

N I I I I I I M M I II II II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
122 AGTGTGAGGCAGCGGCGGGA/iACCCCCAGCCCTCCTATCGCTGGTTCAAGGATGGCAAGG 181 

Qy AGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAACTCACGAC 610 



Db 


J- CJ z 


I 1 1 1 1 1 M 1 II M II 1 II 1 1 1 II 1 II 1 II 1 1 1 1 1 1 1 1 1 II 1 II II 1 1 1 1 1 1 
AACTCAACCGGAGTCGTGATATTCGCATCAAGTATGGCAATGTCAGAAAGAACTCACGGC 


241 




oil 


TACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGAGAACA 
1 1 1 M 1 1 1 1 1 1 1 1 Mil 1 1 1 1 1 II 1 II 1 1 1 1 1 II 1 Mill II 1 1 1 II M 1 1 1 1 
TACAGTTCAACAAAGTGAGCGTGGAGGATGCCGGGGAGTACGTCTGTGAGGCCGAGAACA 


670 




Z 4 Z 


301 


Ov 


D / ± 


TCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCACCACCCTGT 
1 1 I 1 N 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II II 1 11 1 1 1 II 1 1 1 M 1 1 1 II 1 

TCCTTGGGAAGGACACCGTGAGGGGCCGACTCCATGTCAACAGCGTGACCACCACTCTGT 


730 


L/iJ 


o uz 


361 




/ ol 


CATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTGCGTCAATG 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II M MM 

CATCCTGGTCGGGACATGCCCGGAAGTGCAATGAGACCGCCAAGTCCTACTGTGTGAATG 


790 




T o 


421 


Qy 


791 


GAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCCTGTGGGAT 

M M II 1 1 M M M M II II II M M 1 1 1 1 1 1 1 1 i 1 M 1 M t 1 1 1 ii t tt mm 

' ' ' ' ' ' ' ' ' 1 1 M M M I M M M M M M 1 M I 1 1 1 1 M 1 1 11 1 

GAGGCGTGTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCCAAACGGAT 


850 


Db 


422 


481 


Qy 


851 


ACACCGGGGACAGGTGTCAGCAG 873 

1 Ml 1 II II 1 III 
TCTTCGGACAGAGATGTTTGGAG 504 




Db 


482 





cDNA encoding murine secreted splice variant of Don-1. 

Murine; Don-1; epidermal growth factor; EGF; neuregulin; mouse; 
glycoprotein ligand; cell proliferation; cell proliferative disorder; 
carcinoma; adenocarcinoma cell; myeloma; cell differentiation; 
cell survival; epithelial cell; wound healing; tumour formation; brain; 
vulnerary; cytostatic; gene therapy; chromosome 18; gene; ss. 



RESULT 15 
ABS56034 

ID ABS56034 standard; cDNA; 1561 BP. 
XX 

AC ABS56034; 
XX 

DT 14-JAN-2003 (first entry) 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
KW 
XX 

OS Mus sp, 
XX 

™ Location/Qualifiers 
FT CDS 78. .623 

FT /*tag- a 

/product= "Secreted splice variant of Don-1" 

XX 

PN US2002127594-A1. 
XX 

PD 12-SEP-2002, 
XX 

PF 12-MAR-2002; 2002US-00096241 . 
XX 

PR 22-JUN-2000; 2000US-00599789 . 
XX 

PA (GEAR/) GEARING D P. 
PA (BUSF/) BUSFIELD S J. 



PI Gearing DP, Busfield SJ; 
XX 

DR WPI; 2003-039584/03. 

DR P-PSDB; ABG71637. 
XX 

PT Novel Don-1 polypeptide useful for stimulating proliferation of cells, 

PT for identifying proteins that interact with Don-1, and for regulating 

PT tumor formation and progression in brain. 
XX 

PS Claim 4; Fig 2; 66pp; English. 
XX 

CC The present invention relates to the isolation of a novel gene called Don 

CC -1, and alternate splice variants of Don-1, which are related to 

CC epidermal growth factors (EGF) such as neuregulins . Don-1 polypeptides 

CC are glycoprotein ligands . Both murine and human Don-1 sequences are 

CC cloned. The mouse Don-1 gene maps to chromosome 18. Don-1 polypeptides 

CC are useful for stimulating proliferation of a cell. T^tibodies to Don-1 

CC polypeptides are useful for detecting Don-1 in a sample. The Don-1 

CC polypeptides are useful for treating and diagnosing cell proliferative 

CC disorders and play a role in the proliferation of carcinomas e.g. 

CC adenocarcinoma, myeloma, in cell differentiation, proliferation and 

CC survival. The polypeptides are also useful for inhibiting proliferation 

CC of adenocarcinoma cells, for stimulating the proliferation of cells such 

CC as epithelial cells to promote wound healing, for identifying proteins 

CC that interact with Don-1, and for regulating tumour formation and 

CC progression in the brain. The polynucleotide sequences encoding Don-1 may 

CC be used in gene therapy. The present sequence encodes murine secreted 

CC splice variant of Don-1 

XX 

SQ Sequence 1561 BP; 361 A; 479 C; 465 G; 256 T; 0 U; 0 Other; 

Query Match 43.9%; Score 393.4; DB 7; Length 1561; 

Best Local Similarity 87.7%; Pred. No. 2.8e-80; 

Matches 441; Conservative 0; Mismatches 61; Indels 1; Gaps 1; 



Qy 


371 


CCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCC 

1 M 1 1 1 M 1 1 1 1 M 1 1 M 1 1 1 M 1 1 1 M 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 I I 1 1 | 

CTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCA-CC 


430 


Db 


2 


60 


Qy 


431 


GGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAATCGCTGA 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 II 1 1 I 1 1 1 1 1 1 II 1 

GGCCCAAGCTGAAGAAGATGAAGAGCCAGACAGGAGAGGTGGGTGAGAAGCAGTCGCTCA 


490 


Db 


61 


120 


Qy 


491 


AGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGGCAAGG 

1 1 1 1 1 N 1 II II 1 II II II 1 1 1 1 M 1 1 1 1 II 1 II 1 1 1 1 1 1 M 1 1 1 II 1 1 M 1 1 

AGTGTGAGGCAGCGGCGGGAAACCCCCAGCCCTCCTATCGCTGGTTCAAGGATGGCAAGG 


550 


Db 


121 


180 


Qy 


551 


AGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAACTCACGAC 
1 1 1 1 1 M II II II II 1 1 M 1 1 1 II 1 1 1 1 M 1 II 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 
AACTCAACCGGAGTCGTGATATTCGCATCAAGTATGGCAATGTCAGAAAGAACTCACGGC 


610 


Db 


181 


240 


Qy 


611 


TACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGAGAACA 

1 1 1 M II 1 1 1 1 1 1 1 II 1 1 II II II 1 1 1 II 1 III 1 II 

TACAGTTCAACAAAGTGAGGGTGGAGGATGCCGGGGAGTACGTCTGTGAGGCCGAGAACA 


670 


Db 


241 


300 



Qy 



671 TCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCACCACCCTGT 730 



301 TCCTTGGGAAGGACACCGTGAGGGGCCGACTCCATGTCAACAGCGTGAGCACCACTCTGT 360 
731 CATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTGCGTCAATG 790 

I I I I M I I I M I I II I I I I I I I I I I I I I I Mill I M I II II I I I II II MM 

361 CATCCTGGTCGGGACATGCCCGGAAGTGCAATGAGACCGCCAAGTCCTACTGTGTGAATG 42 0 
Qy 791 GAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCCTGTGGGAT 850 

I I I M II Ml I I I I II I M I I I M II II II I I M I I M I II II I I Mill MM 

Db 421 GAGGCGTGTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCCAAACGGAT 480 

Qy 851 ACACCGGGGACAGGTGTCAGCAG 8 73 

I II I I II I II III 
Db 481 TCTTCGGACAGAGATGTTTGGAG 503 



Search completed: August 15, 2004, 05:47:25 
Job time : 433.237 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



August 15, 2004, 05:20:54 ; Search time 82.0629 Seconds 

(without alignments) 
6065.966 Million cell updates/sec 

US-09-864-675-3 
897 

1 atgaggcgcgacccggcccc caatggtcaacttctcctaa 897 



1365418 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 682709 seqs, 277475446 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/2/ina/5A_COMB . seq: + 

2 : /cgn2_6/ptodata/2/ina/5B_COMB.seq: * 

3 : /cgn2_6/ptodata/2/ina/ 6A_C0MB .seq: * 

4 : /cgn2_6/ptodata/2/ina/6B_COMB.seq: * 

5: /cgn2_6/ptodata/2/ina/PCTUS_COMB.seq:* 

6: /cgn2_6/ptodata/2/ina/backf ilesl .seq: * 

Fred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 

US-08-753-007A-5 

; Sequence 5, Application US/08753007A 
; Patent No. 6074841 

GENERAL INFORMATION: 

APPLICANT: Gearing, David P. 
; APPLICANT: Busfield, Samantha J. 

TITLE OF INVENTION: DON-1 GENE AND POLYPEPTIDES 
TITLE OF INVENTION: AND USES THEREFOR 
NUMBER OF SEQUENCES: 33 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Fish & Richardson P.C. 

STREET: 225 Franklin Street 
CITY: Boston 
STATE: MA 
COUNTRY: US 
ZIP: 02110-2804 
COMPUTER READABLE FORM: 



MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/753, 007A 
FILING DATE: 19-NOV-19 96 
CLASSIFICATION: 536 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/699,591 
FILING DATE: 19-AUG-1996 
ATTORNEY/AGENT INFORMATION: 
NAME: Fasse, J. Peter 
REGISTRATION NUMBER: 32,983 
REFERENCE/DOCKET NUMBER: 07334/022001 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 617-542-5 070 
TELEFAX: 617-542-8 906 
TELEX: 

INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1884 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/ KEY: Coding Sequence 
LOCATION: 664. . . 1883 
OTHER INFORMATION: 
US-08-753-007A-5 

Query Match 93.1%; Score 835.4; DB 3; Length 1884; 

Best Local Similarity 98.1%; Pred. No. 2.3e-201; 

Matches 856; Conservative 0; Mismatches 16; Indels 1; Gaps 1; 
Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I I I I M I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I 

Db 218 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 277 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 278 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACT^GGCACCCGTGGTGGTGGAG 337 

Qy 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 18 0 

I I I I M I I I I I M I I M I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 338 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 397 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I 
Db 39 8 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 457 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

I I I I I M I I I I M I I I I I I M I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I 

Db 458 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 517 



Qy 



301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 



Db 518 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 577 

Qy 361 CCCCTCGATACCAACGGCAA7W^TCTC7\AG7WVGAGGTGGGCAAGATCCTGTGCACTGAC 420 

Mill I I I I I M I I I II I M I I I II M I I I I I I I I I I I II I I I I II M I M II I I M I 

Db 57 8 CCCCT-GATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGGC 636 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 4 80 

I I I M I I I I I I I I I I I I I II I I I I I II I I II I I M I I I II I I I I I II II II I I I I I M I I 
Db 637 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 696 

Qy 481 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 54 0 

I I I I I I I I I I I I I M I I I I I I I I I I II I I I I I II I II I I I I I I I I I I I I I I I I I I I I I M 
Db 697 C7\ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 756 

Qy 541 GATGGC7W\.GGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

I I II I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I II I I I I I II II I I I I I I I M 
Db 7 57 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 816 

Qy 601 AACTCACGACTACAGTTCT^CT^GGTGTVAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I I I I I M I I I I I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 
Db 817 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 876 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 720 

I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I 
Db 877 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 936 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 780 

I M I I I I I I I I I I I I I I M I II I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I II 

Db 937 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 996 

Qy 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATC7\ACCAGCTCTCCTGC7UVGTGT 840 

M I I I I I I I I I II I I I I I I I I I II M M II I I I I II I I I I I I I I I M I I I I I I I I I III 
Db 997 TGCGTCi^ATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGT 1056 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

II II I I I III I I I I I I III 

Db 1057 CCAAATGGATTCTTCGGACAGAGATGTTTGGAG 108 9 



RESULT 2 
US-09-398-496-5 

; Sequence 5, Application US/09398496 
; Patent No. 6133423 

GENERAL INFORMATION: 

APPLICANT: Gearing, David P. 
; APPLICANT: Bus field, Samantha J. 

TITLE OF INVENTION: DON-1 GENE AND POLYPEPTIDES 
TITLE OF INVENTION: AND USES THEREFOR 
NUMBER OF SEQUENCES: 33 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Fish & Richardson P.C. 

STREET: 225 Franklin Street 
CITY: Boston 
STATE: MA 
COUNTRY: US 
ZIP: 02110-2804 



COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette . 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 2,0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/398,496 
FILING DATE: 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/753,007 
FILING DATE: 19-NOV-1996 
APPLICATION NUMBER: 08/699,591 
FILING DATE: 19-AUG-1996 
ATTORNEY/AGENT INFORMATION: 
NAME: Fasse, J. Peter 
REGISTRATION NUMBER: 32,983 
REFERENCE/DOCKET NUMBER: 07334/022001 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 617-542-5070 
TELEF7\X: 617-542-8 906 
TELEX : 

INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1884 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/KEY: Coding Sequence 
LOCATION: 664, . . 1883 
OTHER INFORMATION: 
US-09-398-496-5 

Query Match 93.1%; Score 835.4; DB 3; Length 1884; 

Best Local Similarity 98,1%; Pred. No. 2.3e-201; 

Matches 856; Conservative 0; Mismatches 16; Indels 1; Gaps 1; 

Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 

Db 218 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 277 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 278 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 337 

Qy 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCT^CAGCACCCGAGAGCCG 18 0 

I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 338 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 397 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 240 

I I I I M I I I I I I I M I I I I I I I M I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I M I I 

Db 398 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACT^AGTGGCCGCTCCGGAGCGGG 457 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGA7\AGGAACCAG 300 

I M I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I M I I I I 



Db 



45 8 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 517 



Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

M I I I I I I I I M I I I I I I I I I I I I I I I I M M I I I I I I I I I M I I I I I I I I I I I I I I I I I 

Db 518 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 577 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 42 0 

Mill I I I II I M II I I I I I I I I I I I II I I I I I I I I I I II I II I I II I I I I I I I I I I I 
Db 578 CCCCT-GATACCAACGGC7VAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGGC 636 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 48 0 

I I I II I I I I M I II II II I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I 
Db 637 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 69 6 

Qy 4 81 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 54 0 

I I I M I II I I I M I I I I II I I II I I I II I I M I I M I II I I I I II I I I I II I I I I II I I I 
Db 697 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 756 

Qy 541 GATGGCAAGGAGCTCT^CCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

M I I I II I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I II I I I I I M I II I I I M I M I 
Db 757 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 816 

Qy 601 AACTCACGACTACAGTTCAACAAGGTG7VAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I I I I I I I I M I I M I II I I I II I I I I I I II I I I I I M I I I I I M I I II I I II I I I I I M I 
Db 817 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 876 

Qy 661 GCCGAG7\ACATCCTGGGG7^GGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 720 

M I I I M I I I II I I I I I I I I I I M II I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I M 

Db 877 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 936 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 780 

I I I I I I 11 I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I I II II I II I I I I 
Db 937 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 996 

Qy 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 84 0 

I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I II II II II M II II II M II I Ml 

Db 997 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGT 1056 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

M I I II I III I II II I III 

Db 1057 CCAAATGGATTCTTCGGACAGAGATGTTTGGAG 108 9 



RESULT 3 

US-08-525-864A-1 

; Sequence 1, Application US/08525864A 
; Patent No. 5912326 

GENERAL INFORMATION: 

APPLICANT: Chang, Han 

TITLE OF INVENTION: Cerebellum-derived Growth Factors, and Uses 
TITLE OF INVENTION: Related thereto 
NUMBER OF SEQUENCES: 18 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: LTUilVE & COCKFIELD 

STREET: 28 State Street 

CITY: Boston 

STATE: Massachusetts 



; COUNTRY: USA 

; ZIP: 02109 

COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: AscII (text) 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 8 / 52 5 , 8 64A 
FILING DATE: 8-SEP-1995 
CLASSIFICATION: 530 
ATTORNEY/AGENT INFORMATION: 
NAME: Kara, Catherine J. 
REGISTRATION NUMBER: 41,106 
REFERENCE/DOCKET NUMBER: HUI-017 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (617)227-74 00 
TELEFAX: (617)7 42-4214 
; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 3441 base pairs 
TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/ KEY: CDS 
; LOCATION: 180.. 2 441 

US-08-525-864A-1 



Query Match 87.4%; Score 784; DB 2; Length 3441; 

Best Local Similarity 92.2%; Pred. No. 2.5e-188; 

Matches 826; Conservative 0; Mismatches 70; Indels 0; Gaps 0; 



Qy 


1 


ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 

N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 

ATGAGGCGCGACCCGGCCCCCGGCTTCTCGATGCTGCTCTTCGGTGTGTCACTCGCCTGC 


60 


Db 


180 


239 


Qy 


61 


TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 

1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 M M 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 I 1 1 1 1 1 1 1 1 M M 1 1 M 1 

TACTCGCCCAGCCTCAAGTCCGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 


120 


Db 


240 


299 


Qy 


121 


GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 

1 1 N 1 1 1 1 1 1 I 1 1 1 1 1 1 1 III II 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

GGCAAGGTACAGGGACTGGCCCCGGCAGGCGGTTCCAGCTCTAACAGCACCCGAGAGCCT 


180 


Db 


300 


359 


Qy 


181 


CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 

II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M II M 1 II 1 1 M 1 1 1 1 1 1 1 M 1 1 1 

CCCGCCTCGGGTCGGGTGGCGCTGGTGAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 


240 


Db 


360 


419 


Qy 


241 


GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 

< 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 II 

GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGCGCGCCGCTCGAAAGGAACCAG 


300 


Db 


420 


479 


Qy 


301 


CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 Ill 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 

CGCTACATCTTTTTCCTGGAGCCCACCGAGCAGCCCTTAGTTTTTAAGACAGCCTTTGCC 


360 


Db 


480 


539 



Qy 


361 


CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 
II 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CCGGTCGACCCTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 


420 


Db 


540 


599 


Qy 


421 


TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 
1 1 1 1 1 1 1 1 II II 1 II 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 II 1 III III II II 1 1 1 1 1 1 
TGCGCAACCCGGCCCAAGCTGAAGAAGATGAAGAGTCAGACAGGAGAGGTGGGCGAGAAG 


480 


Db 


600 


659 


Qy 


481 


CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 
II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II II 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 II 
CAGTCGCTCAAGTGTGAGGCGGCGGCGGGGAACCCCCAGCCCTCCTATCGATGGTTCAAG 


540 


Db 


660 


719 


Qy 


541 


GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 
II M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 II 1 II 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 M 1 1 II II 
GACGGCAAGGAGCTCAACCGGAGTCGTGACATTCGCATCAAGTATGGCAACGGCAGAAAG 


600 


Db 


720 


779 


Qy 


601 


AACTCACGACTACAGTTCT^CT^GGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 
M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 Mill 1 1 1 1 1 Ml 
TWVCTCACGGCTACAGTTCAACAAAGTGAAGGTGGAGGACGCTGGAGAGTACGTCTGTGAG 


660 


Db 


780 


839 


Qy 


661 


GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 

M 1 M 1 II II II 1 M M 1 1 1 M M II II II II 1 1 M 1 II II 1 1 1 1 MUM 

GCTGAGAACATCCTTGGGAAGGACACTGTGAGGGGCCGGCTCCATGTCAACAGTGTGAGC 


720 


Db 


840 


899 


Qy 


721 


ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 

1 1 1 1 1 Mill 1 1 1 1 1 M 1 II II M 1 M 1 M II 1 1 II M 1 II II 1 II II II II II M 

ACCACTCTGTCGTCCTGGTCGGGGCACGCCCGGAAGTGCAATGAGACAGCCAAGTCCTAC 


780 


Db 


900 


959 


Qy 
Db 


781 
960 


TGCGTCAATGGAGGCGTCTGrTArTAPATrf^Af^f^r^rATr AArrarrTPTr'nTr^r anr-fpr-rr 

M II 1 II 1 1 II 1 II 1 1 II 1 M II 1 II 1 M M 1 1 1 1 1 II 1 1 1 II II 1 II II II M 1 

TGTGTG7\ATGGAGGCGTGTGCTACTACATCGAAGGCATCAACCAACTCTCCTGCAAGTGT 


1019 


Qy 


841 


CCTGTGGGATACACCGGGGACAGGTGTCAGCAGTTCGCAATGGTCAACTTCTCCTA 8 96 
M M II 1 II II 1 II 1 M II M II 1 II 1 II 1 II II 1 1 M II II II II M 1 1 M II 1 

CCTGTGGGATACACCGGGGACAGGTGTCAGCAGTTCGCAATGGTCAACTTCTCCAA 1075 




Db 


1020 





RESULT 4 

US-08-525-864A-3 

; Sequence 3, Application US/08525864A 
; Patent No. 5912326 
; GENERAL INFORMATION: 

APPLICANT: Chang, Han 

TITLE OF INVENTION: Cerebellum-derived Growth Factors, and Uses 
; TITLE OF INVENTION: Related thereto 
; NUMBER OF SEQUENCES: 18 

CORRESPONDENCE ADDRESS: 
; 7VDDRESSEE: LAHIVE & COCKFIELD 

STREET: 28 State Street 
CITY: Boston 
STATE: Massachusetts 
COUNTRY: USA 
ZIP: 02109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 



OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: AscII (text) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER; US/08/525 , 864A 
FILING DATE: 8-SEP~1995 
CLASSIFICATION: 530 
ATTORNEY/AGENT INFORMATION: 
NAME: Kara, Catherine J. 
REGISTRATION NUMBER: 41,106 
REFERENCE/DOCKET NUMBER: HUI-017 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: ( 617 ) 227-7400 
TELEFAX: ( 617 ) 742-4214 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 993 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/KEY: CDS 
LOCATION: 1..99 0 
US-08-525-864A-3 

Query Match 82.3%; Score 738.6; DB 2; Length 993; 

Best Local Similarity 90.4%; Pred. No. 4.7e-177; 

Matches 789; Conservative 0; Mismatches 84; Indels 0; Gaps 0; 

Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCGATGCTGCTCTTCGGTGTGTCACTCGCCTGC 60 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I M I I I I I I I 
Db 61 TACTCGCCCAGCCTCAAGTCCGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 120 

Qy 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 180 

I I I I I I I I I I I I I I I I I I III II I I I I I I I I I I I I I I I I I I I I II I II I I I I I 
Db 121 GGCAAGGTACAGGGACTGGCCCCGGCAGGCGGTTCCAGCTCTAACAGCACCCGAGAGCCT 18 0 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

I I I I I I I I I I I I I I II I I II I Mil I I I I I I II I I I I I I I I I I I I I I I I I M I I M I I 
Db 181 CCCGCCTCGGGTCGGGTGGCGCTGGTGAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

I I I I I I I I I I I I I I I I II I I II I I I I I I II I I I I I I I I I I I II I I I II I I I I I M I I I 
Db 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGCGCGCCGCTCGAAAGGAACCAG 300 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I I I I I I I I M I I I M I I I I I II I I I I II I II I I I I I I I I I II I I I I I I I I I I I I I I 

Db 301 CGCTACATCTTTTTCCTGGAGCCCACCGAGCAGCCCTTAGTTTTTAAGACAGCCTTTGCC 360 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 420 

II I I I I I I I I I M M I I I I I I I I I I I M I M M I I I I I I I I I I I I I I I I I I I I 

Db 361 CCGGTCGACCCTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 420 



Qy 


421 


TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 
Mill M II 1 1 1 1 1 1 1 1 1 II II M II 1 1 1 II II 1 II II III M M 1 1 1 1 1 1 1 II 
TGCGCAACCCGGCCCAAGCTGAAGAAGATGAAGAGTCAGACAGGAGAGGTGGGCGAGAAG 


480 


Db 


421 


480 


Qy 


481 


C7\ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 

M Mill 1 1 M M II 1 M II II II M II 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

CAGTCGCTCAAGTGTGAGGCGGCGGCGGGGAACCCCCAGCCCTCCTATCGATGGTTCAAG 


540 


Db 


481 


540 


Qy 


541 


GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 
M M 1 1 1 II II 1 II II II 1 II II 1 II 1 II 1 II 1 1 1 1 1 1 1 II 1 1 II II 1 1 M 1 1 M 
GACGGCAAGGAGCTCAACCGGAGTCGTGACATTCGCATCAAGTATGGCAACGGCAGAAAG 


600 


Db 


541 


600 


Qy 
Db 


601 
601 


AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 
M II II II 1 1 1 1 1 1 II 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 II III 
AACTCACGGCTACAGTTCAACAAAGTGAAGGTGGAGGACGCTGGAGAGTACGTCTGTGAG 


660 
660 


Qy 


661 


GCCGAG7WVCATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 
M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M M II II 1 1 1 1 1 II II 1 1 II 1 1 II 
GCTGAGTyvCATCCTTGGGAAGGACACTGTGAGGGGCCGGCTCCATGTCAACAGTGTGAGC 


720 


Db 


661 


720 


Qy 


721 


ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGT^GTGCAACGAGACAGCCAAGTCCTAT 

Mill INN 1 II M II II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II M II 1 II 1 1 1 

ACCACTCTGTCGTCCTGGTCGGGGCACGCCCGGAAGTGCAATGAGACAGCCAAGTCCTAC 


780 


Db 


721 


780 


Qy 


781 


TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 
II II 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 III 

II II II II II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 M II II 1 1 M 1 1 M 1 II II M 1 1 M 1 III 

TGTGTGAATGGAGGCGTGTGCTACTACATCGAAGGCATCAACCAACTCTCCTGCAAATGT 


840 


Db 


781 


840 


Qy 


841 


CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 
II II II 1 III 1 II 1 1 1 III 
CCAAACGGATTCTTCGGACAGAGATGTTTGGAG 873 




Db 


841 





RESULT 5 

US-08-753-007A-7 

; Sequence 7, Application US/08753007A 

; Patent No. 6074841 

; GENERAL INFORMATION: 

APPLICANT: Gearing, David P. 
APPLICANT: Busfield, Samantha J. 
; TITLE OF INVENTION: DON-1 GENE AND POLYPEPTIDES 

; TITLE OF INVENTION: AND USES THEREFOR 

NUMBER OF SEQUENCES: 33 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson P.C. 

STREET: 225 Franklin Street 

CITY: Boston 

STATE: MA 

COUNTRY: US 

ZIP: 02110-2804 
; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 2,0 
; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/08/753 , 007A 



I 



FILING DATE: 19-NOV-1996 

CLASSIFICATION: 536 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/699,591 

FILING DATE: 19-AUG-1996 
ATTORNEY/AGENT INFORMATION: 

NAME: Fasse, J. Peter 

REGISTRATION NUMBER: 32,983 

REFERENCE/ DOCKET NUMBER: 07334/022001 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 617-542-5070 

TELEFAX: 617-542-8906 
; TELEX: 

INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1476 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/KEY: Coding Sequence 

LOCATION: 69... 1475 

OTHER INFORMATION: 
US-08-753-007A-7 

Query Match 47.7%; Score 427.8; DB 3; Length 1476; 

Best Local Similarity 89.8%; Pred. No. 1.3e-98; 

Matches 459; Conservative 0; Mismatches 52; Indels 0; Gaps 0; 

Qy 363 CCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTG 422 

M I I I I I III I I I I I I I I I I I I II 

Db 98 CCGCGGCAAGAAGCACCCAGAGGGGAGGAAGCGGGAGAGGGAGCCCGATCCCGGGGAGAA 157 

Qy 42 3 CGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 4 82 

I I I I M I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I II I I II II I I I M 
Db 158 AGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 217 

Qy 4 83 ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 542 

I I I I I I I I II I I I I I I M I M M I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I 
Db 218 ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 277 

Qy 543 TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 602 

I I I I II I II I I I I I I I I I I I II I I I I I I I I I I I I I I M I M M I I I I I I I I I I I II I M I 
Db 278 TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 337 

Qy 603 CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 662 

I I I M I I I I II I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I II I I I II I I I I I I I I 
Db 338 CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 397 

Qy 663 CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 722 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 398 CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 457 

Qy 723 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 782 

I I M I I I I I M I I M I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I M M I I I I I M 

Db 458 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 517 



Qy 783 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCC 842 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I M I I I I I I I I I I I Mill 
Db 518 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCC 577 

Qy 843 TGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

I I II I III I I I I I I III 
Db 57 8 AAATGGATTCTTCGGACAGAGATGTTTGGAG 608 



RESULT 6 
US-09-398-496-7 

; Sequence 7, Application US/09398496 
; Patent No. 6133423 
; GENERAL INFORMATION: 

APPLIC7UJT: Gearing, David P. 

APPLICANT: Busfield, Samantha J, 

TITLE OF INVENTION: DON-1 GENE AND POLYPEPTIDES 
TITLE OF INVENTION: AND USES THEREFOR 
; NUMBER OF SEQUENCES: 33 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson P.C. 
STREET: 225 Franklin Street 
CITY: Boston 
STATE: MA 
COUNTRY: US 
ZIP: 02110-2804 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
; OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/398,496 
FILING DATE: 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/753,007 
; FILING DATE: 19-NOV-1996 

APPLICATION NUMBER: 08/699,591 
FILING DATE: 19-AUG-1996 
ATTORNEY/AGENT INFORMATION: 
; NAME: Fasse, J. Peter 

REGISTRATION NUMBER: 32,983 
; REFERENCE/DOCKET NUMBER: 07334/022001 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: 617-542-5070 
; TELEFAX: 617-542-8906 

; TELEX: 

; INFORMATION FOR SEQ ID NO: 7: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 1476 base pairs 

' TYPE: nucleic acid 

; STRANDEDNESS: single 

; TOPOLOGY: linear 

; MOLECULE TYPE: cDNA 
' FEATURE: 



; NAME/KEY: Coding Sequence 

; LOCATION: 69... 1475 

OTHER INFORMATION: 
US-09-398-496-7 

Query Match 47.7%; Score 427.8; DB 3; Length 1476; 

Best Local Similarity 89.8%; Pred. No. l,3e-98; 

Matches 459; Conservative 0; Mismatches 52; Indels 0; Gaps 0 

Qy 363 CCTCGATACCT^CGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTG 422 

M I I I I I III I II I II I I I I I I II 

Db 98 CCGCGGCAAGAAGCACCCAGAGGGGAGGAAGCGGGAGAGGGAGCCCGATCCCGGGGAGAA 157 

Qy 423 CGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 482 

I M I II II I I II M I I I I I I I I I I II I II I I II I I I I I II I I I I I I II I I M I I I I I M 
Db 158 AGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 217 

Qy 483 ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 542 

I I M I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I II I I I I II I I M I I I I I M 

Db 218 ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 277 

Qy 543 TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGT^A 602 

I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I II I I 
Db 27 8 TGGCi^AGGAGCTCAACCGCAGCCGAGACATTCGCATCTWVTATGGCAACGGCAGAAAGTiA 337 

Qy 603 CTCACGACTACAGTTCAACT^GGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 662 

M M I I I I M M I I I I I I I I I I I M I II M I I I I I I I I I I I I I I I M I I I I I I I II I I M 

Db 338 CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 397 

Qy 663 CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 722 

I M I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 398 CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 457 

Qy 723 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 782 

I I I I I I M I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I II I II I I I I I I I 
Db 458 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 517 

Qy 783 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCC 842 

I M I I I M I II I I I I I I I I I I I I I I I M I M I I I I I I I I II I I I I I II I I I I II Mill 

Db 518 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCC 577 

Qy 843 TGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

I I I I I III I I I I I I III 
Db 578 AAATGGATTCTTCGGACAGAGATGTTTGGAG 608 



RESULT 7 

US-08-753-007A-31 

; Sequence 31, Application US/08753007A 

; Patent No. 6074841 

; GENERAL INFORMATION: 

; APPLICANT: Gearing, David P. 

APPLICANT: Busfield, Samantha J. 

TITLE OF INVENTION: DON-1 GENE AND POLYPEPTIDES 
; TITLE OF INVENTION: AND USES THEREFOR 

NUMBER OF SEQUENCES: 33 
CORRESPONDENCE ADDRESS: 



; ADDRESSEE: Fish & Richardson P.C. 

STREET: 225 Franklin Street 

CITY: Boston 

STATE: MA 

COUNTRY: US 

ZIP: 02110-2804 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 
; COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 2,0 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/ 0 8/753 , 0 OVA 

FILING DATE: 19-NOV-1996 

CLASSIFICATION: 536 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/699,591 
; FILING DATE: 19-AUG-1996 

ATTORNEY/AGENT INFORMATION: 
; NAME: Fasse, J. Peter 

; REGISTRATION NUMBER: 32,983 

REFERENCE/DOCKET NUMBER: 07334/022001 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 617-542-5070 
TELEFAX: 617-542-8906 
TELEX : 

INFORMATION FOR SEQ ID NO: 31: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 2268 base pairs 
; TYPE: nucleic acid 

; STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/KEY: Coding Sequence 
LOCATION: 69... 2009 
OTHER INFORMATION: 
US-08-753-007A-31 

Query Match 47.7%; Score 427.8; DB 3; Length 2268; 

Best Local Similarity 89.8%; Pred. No. 1.5e-98; 

Matches 459; Conservative 0; Mismatches 52; Indels 0; Gaps 0; 

Qy 363 CCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTG 422 

I I I I I I I III I I II I I I I I I I I II 

Db 98 CCGCGGCAAGAAGCACCCAGAGGGGAGGAAGCGGGAGAGGGAGCCCGATCCCGGGGAGAA 157 

Qy 423 CGCCACCCGGCCCAAGTTGT^GAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 4 82 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I M I I I I I I I 

158 AGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 217 

Qy 4 83 ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTC7WVGGA 542 

I I I II I I I M I I I I I I I I I I I II II I I I I I II I I I I I I I I I M I I I I M I I I II M I I I I 
Db 218 ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 277 

Qy 543 TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 602 
IN M I I I I I I I I I II M I M I I II II I I I I I I I I I I I I I 



Db 



278 TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 337 



Qy 603 CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 662 

M I I I I M M I I I M I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 33 8 CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 397 

Qy 663 CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 722 

I I I M I I I I I I I I M M I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I M 

Db 398 CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 457 

Qy 723 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 7 82 

I I I M M I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 

Db 458 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCT^CGAGACAGCCAAGTCCTATTG 517 

Qy 7 83 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCC 8 42 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I M I I I I I I I I 
Db 518 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCC 577 

Qy 843 TGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

I I I I I III I I I I I I III 
Db 578 AAATGGATTCTTCGGACAGAGATGTTTGGAG 608 



RESULT 8 

US-09-398-496-31 

; Sequence 31^ Application US/09398496 
; Patent No, 6133423 

GENERAL INFORMATION: 

APPLICANT: Gearing, David P. 
; APPLICANT: Bus field, Samantha J. 

TITLE OF INVENTION: DON-1 GENE AND POLYPEPTIDES 
; TITLE OF INVENTION: AND USES THEREFOR 

NUMBER OF SEQUENCES: 33 
CORRESPONDENCE ADDRESS: 

7UDDRESSEE: Fish & Richardson P.C. 
STREET: 225 Franklin Street 
CITY: Boston 
STATE: MA 
; COUNTRY: US 

ZIP: 02110-2804 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Diskette 

; COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 2.0 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/398,496 
FILING DATE: 
CLASSIFICATION: 
; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: 08/753,007 

FILING DATE: 19-NOV-1996 
; APPLICATION NUMBER: 08/699,591 

FILING DATE: 19-AUG-1996 
ATTORNEY/AGENT INFORMATION: 
NAME: Fasse, J, Peter 
; REGISTRATION NUMBER: 32,983 



REFERENCE/DOCKET NUMBER: 07334/022001 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 617-542-5070 

TELEFAX: 617-542-89 06 
; TELEX: 

INFORMATION FOR SEQ ID NO: 31: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 2268 base pairs 

TYPE: nucleic acid 
; STRANDEDNESS : single 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/KEY: Coding Sequence 
LOCATION: 69... 2009 
OTHER INFORMATION: 
US-09-398-496-31 



Query Match 47.7%; Score 427.8; DB 3; Length 2268; 

Best Local Similarity 89.8%; Pred. No. 1.5e-98; 

Matches 459; Conservative 0; Mismatches 52; Indels 0; Gaps 0; 



Qy 


363 


CCTCGATACCJ^ACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTG 


422 


Db 


98 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 II 

CCGCGGCAAGAAGCACCCAGAGGGGAGGAAGCGGGAGAGGGAGCCCGATCCCGGGGAGAA 


157 


Qy 

Db 


423 
158 


CGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 

1 N 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 M 1 1 1 M 

AGCCACCCGGCCCAAGTTGiyVGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 


482 
217 


Qy 


483 


ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 

1 1 1 M 1 1 M M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 II M 

ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCT^GGA 


542 


Db 


218 


277 


Qy 


543 


TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 

1 1 1 1 1 1 1 1 M 1 1 II M 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 II 1 1 II 1 1 1 1 1 1 II 1 II II M 1 II 1 1 1 

TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAT^TATGGCAACGGCAGAAAGAA 


602 


Db 


278 


337 


Qy 


603 


CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M II 

CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 


662 


Db 


338 


397 


Qy 

Db 


663 
398 


CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 
1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 
CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 


722 
457 


Qy 


723 


CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGC7\ACGAGACAGCCAAGTCCTATTG 

1 M 1 1 1 M M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 

CACCCTGTCATCCTGGTCGGGGCACGCCCGGT^AGTGCAACGAGACAGCCAAGTCCTATTG 


782 


Db 


458 


517 


Qy 


783 


CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCC 

UN 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCC 


842 


Db 


518 


577 


Qy 


843 


TGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

1 II 1 1 III 1 1 1 1 1 1 III 
AAATGGATTCTTCGGACAGAGATGTTTGGAG 60 8 




Db 


578 





RESULT 9 

US-08-753-007A-3 

; Sequence 3, Application US/08753007A 

; Patent No. 6074841 

; GENER7VL INFORMATION: 

APPLICANT: Gearing, David P. 
APPLICANT: Bus field, Samantha J. 
; TITLE OF INVENTION: DON-1 GENE AND POLYPEPTIDES 

TITLE OF INVENTION: T^ND USES THEREFOR 
NUMBER OF SEQUENCES: 33 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson P.C. 
; STREET: 225 Franklin Street 

CITY: Boston 
; STATE: MA 

COUNTRY: US 
ZIP: 02110-2804 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
; COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 
SOFTWT^E: FastSEQ Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/753, 007A 
FILING DATE: 19-NOV-1996 
CLASSIFICATION: 536 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/699,591 
FILING DATE: 19-AUG-1996 
ATTORNEY/AGENT INFORMATION: 
; NAME: Fasse, J. Peter 

REGISTRATION NUMBER: 32,983 
REFERENCE/DOCKET NUMBER: 07334/022001 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 617-542-5070 
; TELEFAX: 617-542-8906 

TELEX: 

INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 1607 base pairs 

; TYPE: nucleic acid 

; STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/KEY: Coding Sequence 
LOCATION: 79. . . 621 
OTHER INFORMATION: 
US-08-753-007A-3 

Query Match 45.2%; Score 405.4; DB 3; Length 1607; 

Best Local Similarity 87.9%; Pred. No. 5.8e-93; 

Matches 442; Conservative 0; Mismatches 61; Indels 0; Gaps 0; 



2y 371 CCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCC 430 



Db 


2 


1 1 1 M 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 

CTAACGGCAAAAACATCAAGAAAGAGGTGGGCT^GATCCTGTGCACTGACTGCGCCACCC 


61 


Qy 


431 


GGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAATCGCTGA 

1 1 1 1 N 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 III II 1 II M II II III M 1 

GGCCCAAGCTGAAGAAGATGAAGAGCCAGACAGGAGAGGTGGGTGAGAAGCAGTCGCTCA 


490 


Db 


62 


121 


Qy 


491 


AGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGGCAAGG 

UN II 1 1 II II II 1 II II II 1 Mill II 1 1 1 M 1 1 1 1 1 1 1 1 1 

AGTGTGAGGCAGCGGCGGGAAACCCCCAGCCCTCCTATCGCTGGTTCAAGGATGGCAAGG 


550 


Db 


122 


181 


Qy 


551 


AGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAACTCACGAC 

1 1 1 M M 1 1 M II II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

AACTCAACCGGAGTCGTGATATTCGCATCAAGTATGGCAATGTCAGAAAGAACTCACGGC 


610 


Db 


182 


241 


Qy 


611 


TACAGTTCAACAAGGTGT^GGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGAGAACA 

1 1 1 1 1 1 1 1 M II 1 MM 1 II II II II II M 1 M II 1 II 1 II 1 M II II II 1 M 1 

TACAGTTCAAC7WVGTGAGGGTGGAGGATGCCGGGGAGTACGTCTGTGAGGCCGAGAACA 


670 


Db 


242 


301 


Qy 


671 


TCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCACCACCCTGT 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 II 1 II 1 1 M II II M II M II II MM 

TCCTTGGG7VAGGACACCGTGAGGGGCCGACTCCATGTCAACAGCGTGAGCACCACTCTGT 


730 


Db 


302 


361 


Qy 


731 


CATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTGCGTCAATG 

1 M 1 1 1 1 1 I 1 1 1 II II M II II 1 1 Mill II 1 M M M II II II MM 

CATCCTGGTCGGGACATGCCCGGAAGTGCAATGAGACCGCCAAGTCCTACTGTGTGAATG 


790 


Db 


362 


421 


Qy 


791 


GAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCCTGTGGGAT 

1 1 1 1 1 t 1 1 1 1 1 1 1 t 1 1 1 1 r 1 1 1 1 1 1 t 1 1 1 1 1 . . J . ... 

1 1 1 N M 1 M M M II M 1 II 1 II M II II 1 M II 1 1 II M II II MM! MM 

GAGGCGTGTGCTACTACATCGAGGGCATCT^CCAGGTCTCCTGCAAATGTCCAAACGGAT 


850 


Db 


422 


481 


Qy 


851 


ACACCGGGGACAGGTGTCAGCAG 873 

1 III 1 1 1 II 1 III 
TCTTCGGACAGAGATGTTTGGAG 504 




Db 


482 





RESULT 10 
US-09-398-496-3 

; Sequence 3, Application US/09398496 
; Patent No. 6133423 
; GENERAL INFORMATION: 

APPLICANT: Gearing, David P. 

APPLICANT: Bus field, Samantha J, 

TITLE OF INVENTION: DON-1 GENE AND POLYPEPTIDES 
; TITLE OF INVENTION: AND USES THEREFOR 

; NUMBER OF SEQUENCES: 33 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson P.C. 

STREET: 225 Franklin Street 

CITY: Boston 

STATE: MA 

COUNTRY: US 

ZIP: 02110-2804 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 



SOFTWARE: FastSEQ Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/398 , 496 
; FILING DATE: 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: 08/753,007 

FILING DATE: 19-NOV-1996 
; APPLICATION NUMBER: 08/699,591 

; FILING DATE: 19-AUG-1996 

; ATTORNEY/AGENT INFORMATION: 
; NAME: Fasse, J. Peter 

REGISTRATION NUMBER: 32,983 

REFERENCE/DOCKET NUMBER: 07334/022001 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 617-542-5070 

TELEFAX: 617-542-8906 

TELEX: 

; INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1607 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/KEY: Coding Sequence 
LOCATION: 79. , , 621 
OTHER INFORMATION: 
US-09-398-496-3 



Query Match 45.2%; Score 405.4; DB 3; Length 1607; 

Best Local Similarity 87.9%; Pred. No. 5.8e-93; 

Matches 442; Conservative 0; Mismatches 61; Indels 0; Gaps 0; 



Qy 


371 


CCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCC 

1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 I 1 1 M 1 I 1 1 1 1 1 1 1 1 1 1 1 M 

CTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCC 


430 


Db 


2 


61 


Qy 


431 


GGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAATCGCTGA 

M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 M 1 1 III M 1 1 M 1 1 1 1 II M II 1 

GGCCC7\AGCTGAAGAAGATGAAGAGCCAGACAGGAGAGGTGGGTGAGAAGCAGTCGCTCA 


490 


Db 


62 


121 


Qy 


491 


AGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGGCAAGG 

1 N 1 M 1 1 1 1 1 M II II II 1 M 1 1 1 II 1 1 1 1 1 M M 1 1 1 1 1 II 1 1 1 1 1 1 1 M 1 

AGTGTGAGGCAGCGGCGGGAAACCCCCAGCCCTCCTATCGCTGGTTCAAGGATGGCAAGG 


550 


Db 


122 


181 


Qy 


551 


AGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAACTCACGAC 
1 1 1 1 1 1 1 1 1 II II II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 
AACTCAACCGGAGTCGTGATATTCGCATCAAGTATGGCAATGTCAGAAAGAACTCACGGC 


610 


Db 


182 


241 


Qy 


611 


TACAGTTCAACAAGGTG7\AGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGAGAACA 

M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 

TACAGTTCAACAAAGTGAGGGTGGAGGATGCCGGGGAGTACGTCTGTGAGGCCGAGAACA 


670 


Db 


242 


301 


Qy 


671 


TCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCACCACCCTGT 


730 



I N I I I Ml I I I I I I I M I I I I I I I I I I I I 



Dd 


302 


TCCTTGGGAAGGACACCGTGAGGGGCCGACTCCATGTCAACAGCGTGAGCACCACTCTGT 


361 


Qy 


731 


CATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTGCGTCAATG 

N 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II II II 1 1 

CATCCTGGTCGGGACATGCCCGGAAGTGCAATGAGACCGCCAAGTCCTACTGTGTGAATG 


790 


Db 


362 


421 


Qy 


791 


GAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCCTGTGGGAT 

11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 t 1 1 I I 

'''>'■' M 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 I 1 1 1 1 1 1 M INI 

GAGGCGTGTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCCAAACGGAT 


850 


Db 


422 


481 


Qy 


851 


ACACCGGGGACAGGTGTCAGCAG 873 




Db 


482 


1 III 1 1 1 1 II III 
TCTTCGGACAGAGATGTTTGGAG 504 





RESULT 11 
US-08-753-007A-1 

; Sequence 1, Application US/08753007A 

; Patent No. 6074841 

; GENERAL INFORMATION: 

; APPLICANT: Gearing, David P. 

APPLICANT: Bus field, Samantha J. 

TITLE OF INVENTION: DON-1 GENE AND POLYPEPTIDES 
TITLE OF INVENTION: AND USES THEREFOR 
NUMBER OF SEQUENCES: 33 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson P.C. 
; STREET: 225 Franklin Street 

CITY: Boston 

STATE : MA 

COUNTRY: US 

ZIP: 02110-2804 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/753 , 007A 
; FILING DATE: 19-NOV-1996 

CLASSIFICATION: 536 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/699,591 
; FILING DATE: 19-AUG-1996 

; ATTORNEY/AGENT INFORMATION: 
; NAME: Fasse, J. Peter 

; REGISTRATION NUMBER: 32,983 

r REFERENCE/DOCKET NUMBER: 07334/022001 

r TELECOMMUNICATION INFORMATION: 

; TELEPHONE: 617-542-5070 

f TELEFAX: 617-542-8906 

; TELEX: 

; INFORMATION FOR SEQ ID NO: 1: 
' SEQUENCE CHARACTERISTICS: 

LENGTH: 2467 base pairs 

TYPE: nucleic acid 

STRANDEDNESS: single 



TOPOLOGY: circular 
MOLECULE TYPE: cDNA 
; FEATURE : 

; N7My[E/KEY: Coding Sequence 

LOCATION: 79... 1893 
OTHER INFORMATION: 

US-08-753-007A-1 



Query Match 45.2%; Score 405.4; DB 3; Length 2467; 

Best Local Similarity 87.9%; Pred. No. 6.7e-93; 

Matches 442; Conservative 0; Misinatches 61; Indels 0; Gaps 0; 

Qy 371 CCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCC 430 

I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CTAACGGCAAAAACATCAAGAAAGAGGTGGGCT^AGATCCTGTGCACTGACTGCGCCACCC 61 



Db 2 



431 GGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAATCGCTGA 490 
I I I I N I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
^2 GGCCCAAGCTGAAGAAGATGAAGAGCCAGACAGGAGAGGTGGGTGAGAAGCAGTCGCTCA 121 

Qy 4 91 AGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGGCAAGG 550 

I I I I I I I I I I I I I II II II I II II I I M II M I M I M 

AGTGTGAGGCAGCGGCGGGAAACCCCCAGCCCTCCTATCGCTGGTTCAAGGATGGCAAGG 181 



Db 122 



Qy 551 AGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAACTCACGAC 610 

I I I I M I M M II II I I I II I I I I II I I I II I I I I I II I I I I I II I I I I I I 

AACTCAACCGGAGTCGTGATATTCGCATCAAGTATGGCAATGTCAGAAAGAACTCACGGC 241 



Db 182 



Qy 611 TACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGAGAACA 670 

I I I I II I I II I I I I I I I M I I I II I I II I I I II I I I I I I II I I I I M I I I I M I 
TACAGTTCAACAAAGTGAGGGTGGAGGATGCCGGGGAGTACGTCTGTGAGGCCGAGAACA 



Db 242 



301 



Qy 671 TCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCACCACCCTGT 730 

_ I'll INN II II III II I II II I I II Mill IMIIIIMM MM 

Db 302 TCCTTGGGAAGGACACCGTGAGGGGCCGACTCCATGTCAACAGCGTGAGCACCACTCTGT 361 

Qy 731 CATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTGCGTCAATG 790 

MM MUM II I M III II II III I III Mill III I I III I II II M MM 

Db 362 CATCCTGGTCGGGACATGCCCGGAAGTGCAATGAGACCGCCAAGTCCTACTGTGTGAATG 421 

Qy 7 91 GAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCCTGTGGGAT 850 

M I I I I M M I I II II I M M I I I M I I I I II II II I I II I 

GAGGCGTGTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCCAAACGGAT 4 81 



Db 422 



Qy 851 ACACCGGGGACAGGTGTCAGCAG 873 

I Ml I II II I III 
Db 4 82 TCTTCGGACAGAGATGTTTGGAG 504 



RESULT 12 
US-09-398-496-1 

; Sequence 1, Application US/09398496 
; Patent No. 6133423 

GENERAL INFORMATION: 
; APPLICANT: Gearing, David P. 

APPLICANT: Busfield, Samantha J. 
; TITLE OF INVENTION: DON-1 GENE AND POLYPEPTIDES 



TITLE OF INVENTION: AND USES THEREFOR 
NUMBER OF SEQUENCES: 33 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson P.O. 
STREET: 225 Franklin Street 
CITY: Boston 
; STATE: MA 

COUNTRY: US 
ZIP: 02110-2804 
; COMPUTER READT^LE FORM: 

MEDIUM TYPE: Diskette 
; COMPUTER: IBM Compatible 

; OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/39 8 , 4 96 
FILING DATE: 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/753,007 
FILING DATE: 19-NOV-1996 
APPLICATION NUMBER: 08/699,591 
FILING DATE: 19-AUG-19 96 
ATTORNEY/AGENT INFORMATION: 
; NAME: Fasse, J. Peter 

REGISTRATION NUMBER: 32,983 
; REFERENCE/DOCKET NUMBER: 07334/022001 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: 617-542-5070 
; TELEF7\X: 617-542-8906 

TELEX : 

INFORMATION FOR SEQ ID NO: 1: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 2467 base pairs 
TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: circular 
MOLECULE TYPE: cDNA 
; FEATURE : 

NAME/KEY: Coding Sequence 
LOCATION: 79. .. 18 93 
OTHER INFORMATION: 
US-09-398-496-1 

Query Match 45.2%; Score 405.4; DB 3; Length 2467; 

Best Local Similarity 87.9%; Pred. No, 6.7e-93; 

Matches 442; Conservative 0; Mismatches 61; Indels 0; Gaps 0 
Qy 371 CCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCC 430 



Db 



2 CTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCC 61 




Qy 



431 GGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAATCGCTGA 4 90 





Db 



62 GGCCCAAGCTGAAGAAGATGAAGAGCCAGACAGGAGAGGTGGGTGAGAAGCAGTCGCTCA 121 



Qy 491 AGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGGCAAGG 550 



Db 


122 


1 1 1 M 1 1 1 1 1 1 1 1 II II II 1 1 1 II 1 1 1 1 1 1 II M II 1 1 1 II II 1 1 1 1 1 1 1 1 II 
AGTGTGAGGCAGCGGCGGGAAACCCCCAGCCCTCCTATCGCTGGTTCAAGGATGGCAAGG 


181 


Qy 


551 


AGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAACTCACGAC 
1 M 1 II II 1 II II II 1 1 II 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AACTCAACCGGAGTCGTGATATTCGCATCAAGTATGGCAATGTCAGAAAGAACTCACGGC 


610 


Db 


182 


241 


Qy 


611 


TACAGTTCAAC7UVGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGAGAACA 
N 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II II 
TACAGTTCAACAAAGTGAGGGTGGAGGATGCCGGGGAGTACGTCTGTGAGGCCGAGAACA 


670 


Db 


242 


301 


Qy 


671 


TCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCACCACCCTGT 
1 1 1 1 1 1 II 1 1 II 1 M 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II MM 
TCCTTGGGAAGGACACCGTGAGGGGCCGACTCCATGTCAACAGCGTGAGCACCACTCTGT 


730 


Db 


302 


361 


Qy 


731 


CATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTGCGTCAATG 

1 1 N 1 II 1 II 1 1 1 II 1 1 1 1 II II II M II Mill 11 1 1 1 1 M M 1 M II 1 1 1 1 

CATCCTGGTCGGGACATGCCCGGAAGTGCAATGAGACCGCCAAGTCCTACTGTGTGAATG 


790 


Db 


362 


421 


Qy 


791 


GAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCCTGTGGGAT 

1 1 1 1 M 1 M M 1 M 1 1 1 1 1 1 II II II 1 II II II 1 II 1 1 II II 1 1 1 1 II 1 1 MM 

GAGGCGTGTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGC7WVTGTCCAAACGGAT 


850 


Db 


422 


481 


Qy 


851 


ACACCGGGGACAGGTGTCAGCAG 873 

1 III 1 II 1 1 1 III 
TCTTCGGACAGAGATGTTTGGAG 504 




Db 


482 





RESULT 13 
US-08-525-864A-5 

; Sequence 5, Application US/08525864A 

; Patent No. 5912326 

; GENERAL INFORMATION: 

; APPLIC7\NT: Chang, Han 

TITLE OF INVENTION: Cerebellum-derived Growth Factors, and Use 
; TITLE OF INVENTION: Related thereto 
NUMBER OF SEQUENCES: 18 
CORRESPONDENCE ADDRESS: 

7VDDRESSEE: LAHIVE & COCKFIELD 

STREET: 28 State Street 

CITY: Boston 

STATE: Massachusetts 
; COUNTRY: USA 

ZIP: 02109 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: AscII (text) 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/525, 864A 

FILING DATE: 8-SEP-1995 
; CLASSIFICATION: 530 

ATTORNEY/AGENT INFORMATION: 

NAME: Kara, Catherine J, 

REGISTRATION NUMBER: 41,106 

REFERENCE/ DOCKET NUMBER: HUI-017 



TELECOMMUNICATION INFORMATION: 
TELEPHONE: (617)227-7400 
TELEFAX: (617)742-4214 
; INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1207 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
; MOLECULE TYPE: cDNA 
FEATURE : 

NAME/KEY: CDS 
LOCATION: 2.. 394 
US-08-525-864A-5 



Query Match 24.1%; Score 216.2; DB 2; Length 1207; 

Best Local Similarity 86.3%; Pred. No. 2.7e-45; 

Matches 239; Conservative 0; Mismatches 38; Indels 0; Gaps 0; 



Qy 


597 


AAAGAACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTG 
1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 { 1 1 1 1 Mill 1 1 1 1 1 
AAAGAACTCACGGCTACAGTTCAACAAAGTGAAGGTGGAGGACGCTGGAGAGTACGTCTG 


656 


Db 


1 


60 


Qy 


657 


CGAGGCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGT 
1 1 1 N 1 1 1 II 1 1 1 II 1 II M 1 1 M 1 1 1 1 1 1 1 1 II 1 II 1 1 1 II 1 1 1 1 M 1 1 
TGAGGCTGAGAACATCCTTGGGAAGGACACTGTGAGGGGCCGGCTCCATGTCAACAGTGT 


716 


Db 


61 


120 


Qy 


717 


GAGCACCACCCTGTCATCCTGGTCGGGGCACGCCCGG7\AGTGCAACGAGACAGCCAAGTC 

N 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 II 1 M 1 1 1 1 II M 

GAGCACCACTCTGTCGTCCTGGTCGGGGCACGCCCGGAAGTGCAATGAGACAGCCAAGTC 


776 


Db 


121 


180 


Qy 


777 


CTATTGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAA 

III II II 1 M 1 1 1 1 1 1 II 1 M 1 II 1 1 1 1 1 M 1 II 1 1 II II 1 1 1 II 1 M 1 

CTACTGTGTGAATGGAGGCGTGTGCTACTACATCGTyVGGCATCAACCAACTCTCCTGCAA 


836 


Db 


,181 


240 


Qy 


837 


GTGTCCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

1 1 1 1 1 Mill III 1 1 1 1 1 1 III 
ATGTCCAAACGGATTCTTCGGACAGAGATGTTTGGAG 277 




Db 


241 





RESULT 14 
US-08-036-555B-21 

; Sequence 21^ Application US/08036555B 
; Patent No. 5530109 

GENERAL INFORMATION: 

APPLICANT: Goodearl, Andrew; Stroobant, Paul; 
; APPLIC7VNT: Minghetti, Luisa; Waterfield, Michael; Marchioni, Mark; 

APPLICiWT: Chen, Maio Su; Hiles, Ian 
; TITLE OF INVENTION: Glial Mitogenic Factors, Their 
TITLE OF INVENTION: Preparation and Use 
NUMBER OF SEQUENCES: 184 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Felfe & Lynch 
; STREET: 8 05 Third Avenue 

CITY: New York City 
STATE: New York 
COUNTRY: USA 



ZIP: 10022 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette, 5.25 inch, 360 kb storage 
; COMPUTER: IBM 

OPERATING SYSTEM: PC-DOS 

SOFTWARE: Wordperfect 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/036, 555B 

FILING DATE: 24-MAR-1993 

CLASSIFICATION: 435 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/965,173 

FILING DATE: 23-OCT-1992 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/940,389 

FILING DATE: 03-SEP-1992 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/907,138 

FILING DATE: 30-JUN-1992 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/863,703 

FILING DATE: 03-APRIL-1992 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: U.K. 91 07566.3 

FILING DATE: lO-APRIL-1991 
; ATTORNEY/AGENT INFORMATION: 

NAME: Tsai, Christine H. 
; REGISTRATION NUMBER: 34,266 

; REFERENCE/ DOCKET NUMBER: LUD 5250.4 

TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (212) 688-9200 

TELEFAX: (212) 838-3884 
; INFORMATION FOR SEQ ID NO: 21: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 2003 

TYPE: nucleic acid 

STRANDEDNESS: single 
TOPOLOGY: linear 
; FEATURE : 

OTHER INFORMATION: N in positions 31 and 32 could be either 
OTHER INFORMATION: A or G. 
US-08-036-555B-21 



Query Match 9.4%; Score 84; DB 1; Length 2003; 

Best Local Similarity 49.0%; Pred. No. 6.9e-12; 

Matches 361; Conservative 0; Mismatches 345; Indels 30; Gaps 4; 

GGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGGGGGCTGCA 248 

" Ml II I I III MM I I M I I III 

Db 7 62 GGTGTGGGCGGTGAAAGCCGGGGGCTTGAAGAAGGACTCGCTGCTCACCGTGCGCCTGGG 821 

Qy 24 9 GCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAGCGCTACAT 308 

I I I III MM Ml III MINI 

Db 822 GACCTGGGGCCACCCCGCCTTCCCCTCCTGCGGGAGGCTCAAGGAGGACAGCAGGTACAT 881 

Qy 309 CTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACG GCCTT 356 

I M M I I M I I I I III II II 



Db 



882 CTTCTTCATGGAGCCCGACGCCAACAGCACCAGCCGCGCGCCGGCCGCCTTCCGAGCCTC 941 



Qy 357 TGCCCCCCTCGATACCAACGGC7VAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCAC 416 

I I I I I I I III I I I I I I I M I II I I I II I I I I I I I I I 

Db 942 TTTCCCCCCTCTGGAGACGGGCCGGAACCTCAAGAAGGAGGTCAGCCGGGTGCTGTGCAA 1001 

Qy 417 TGACTGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGA 476 

I I I M I I I I I I I II II I I I I I I I I I I I II I III 

Db 1002 GCGGTGCGCCTTGCCTCCCCAATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTC 1061 

Qy 477 GAAGCAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTT 536 

I I I I II I I I I I I I II III I II I I II I 

Db 1062 CAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTCCTCTCTCAGATTCAAGTGGTT 1121 

Qy 537 CAAGGATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCA TCAAATATGGCAACGG 593 

I I I I I I I I I I I I I I I II I I I I MM II 

Db 1122 CAAGAATGGGAATGAATTGAATCGAAAAAACTVAACCACAAAATATCAAGATACAAAAAAA 1181 

Qy 594 CAGAAAGT^CTCACGACTACAGTTCAACAAGGTGTVAGGTGGAGGACGCTGGGGAGTATGT 653 

I MINI II I I I I I I I I I III II I I I I I I I I I I I 

Db 1182 GCCAGGGAAGTCAGAACTTCGCATTAACAAAGCATCACTGGCTGATTCTGGAGAGTATAT 1241 

Qy 654 CTGCGAGGCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAG 713 

Mill III I I I I I I I I I II II I III 

Db 1242 GTGCAAAGTGATCAGCAAATTAGGAAATGACAGTGCCTCTGCCAATATCACCATCGTGGA 1301 

Qy 714 CGTGAGCACCACCCTGTCATCC TGGTCGGGGCACGCCCGGAAGTGCAACGAGAC 767 

I I I I I I I I II I I I I I I I I I I I I I I 

Db 1302 ATCAAACGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAA 1361 

Qy 768 AGCCAAGTCCTATTGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCT 827 

I II I I I I I I II I I I I I I I I I I I I I I I I I II 

Db 1362 GGAG7\AAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGAAAGACCTTTCA7^CCC 1421 

Qy 82 8 CTC CTGCAAGTGTCCTGTGGGATACACCGGGGACAGGTGTCAGCAGTTCGC 878 

III 1 1 1 1 1 1 1 1 1 1 I I 1 1 M 1 1 I 1 1 I r I I I I 

Db 1422 CTCGAGATACTTGTGCAAGTGCCCAAATGAGTTTACTGGTGATCGCTGCCAAAACTACGT 14 81 

Qy 879 AATGGTC7\ACTTCTCC 8 94 

I I I I I I I I I I I I I 
Db 14 82 AATGGCCAGCTTCTAC 14 97 



RESULT 15 
US-08-469-569-21 

; Sequence 21, Application US/08469569 

; Patent No. 5606032 

; GENERAL INFORMATION: 

; APPLICANT: Goodearl, Andrew; Stroobant, Paul; 

; APPLICANT: Minghetti, Luisa; Waterfield, Michael; Marchioni, Mark; 

APPLICANT: Chen, Maio Su; Hiles, Ian 
; TITLE OF INVENTION: Glial Mitogenic Factors, Their 
; TITLE OF INVENTION: Preparation and Use 
; NUMBER OF SEQUENCES: 184 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: Felfe & Lynch 
; STREET: 8 05 Third Avenue 



CITY: New York City 

STATE: New York 

COUNTRY: USA 
; ZIP: 10022 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette, 5.25 inch, 360 kb storage 

COMPUTER: IBM 

OPERATING SYSTEM: PC-DOS 

SOFTWARE: Wordperfect 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/469,569 

FILING DATE: 06-JUN-1995 

CLASSIFICATION: 530 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: 08/036,555 

FILING DATE: 24-MAR-1993 

APPLICATION NUMBER: 07/965,173 
; FILING DATE: 23-OCT-1992 

; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/940,389 

FILING DATE: 03-SEP-1992 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/907,138 

FILING DATE: 30-JUN-1992 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/863,703 

FILING DATE: 03-APRIL-1992 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: U.K. 91 07566.3 

FILING DATE: lO-APRIL-1991 
ATTORNEY/AGENT INFORMATION: 

NAME: Tsai, Christine H. 
; REGISTRATION NUMBER: 34,266 

; REFERENCE/DOCKET NUMBER: LUD 5250.4 

TELECOMMUNICATION INFORMATION: 

TELEPHONE: (212) 688-9200 

TELEFT^: (212) 838-3884 
; INFORMATION FOR SEQ ID NO: 21: 
SEQUENCE CH7VRACTERISTICS: 

LENGTH: 2 003 

TYPE: nucleic acid 

STRANDEDNESS: single 
; TOPOLOGY: linear 

FEATURE : 

OTHER INFORMATION: N in positions 31 and 32 could be either 
OTHER INFORMATION: A or G. 
US-08-469-569-21 



Query Match 9.4%; Score 84; DB 1; Length 2003; 

Best Local Similarity 49.0%; Pred. No. 6.9e-12; 

Matches 361; Conservative 0; Mismatches 345; Indels 30; Gaps 4; 

Qy 189 GGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGGGGGCTGCA 24 8 

II I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 762 GGTGTGGGCGGTGAAAGCCGGGGGCTTGAAGAAGGACTCGCTGCTCACCGTGCGCCTGGG 821 



Qy 24 9 GCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAGCGCTACAT 308 



822 GACCTGGGGCCACCCCGCCTTCCCCTCCTGCGGGAGGCTCAAGGAGGACAGCAGGTACAT 8 81 

Qy 30 9 CTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACG GCCTT 356 

M I I I I I II I I I I IN I I I I 

882 CTTCTTCATGGAGCCCGACGCCAACAGCACCAGCCGCGCGCCGGCCGCCTTCCGAGCCTC 941 

Qy 357 TGCCCCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCAC 416 

I I I I I I I I II II I II II II I I II II II I I I I I I II I 

Db 942 TTTCCCCCCTCTGGAGACGGGCCGGAACCTCAAGAAGGAGGTCAGCCGGGTGCTGTGCT^ 1001 

Qy 417 TGACTGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGA 476 

MINI I I I I I I I I I I I I I I I I I II I I II I Ml 

Db 1002 GCGGTGCGCCTTGCCTCCCCAATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTC 1061 

Qy 47 7 GAAGCAATCGCTGT^GTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTT 536 

I I I I M I I I I I I I II III I II I I I II 

Db 1062 CAAACTAGTCCTTCGGTGTGA7VACCAGTTCTGAATACTCCTCTCTCAGATTCAAGTGGTT 1121 

Qy 537 CAAGGATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCA TCAJ^ATATGGCAACGG 593 

I I II I I I I I I I I MINI I I I I II I II 

Db 1122 CAAGAATGGGAATGAATTGAATCGAAAAAACAAACCACAAAATATCAAGATACAAAAAAA 1181 

Qy 594 CAGTW^GAACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGT 653 

I I I I I I I MINI III II I II I II I I I I I 

Db 1182 GCCAGGGAAGTCAGAACTTCGCATTAACAAAGCATCACTGGCTGATTCTGGAGAGTATAT 1241 

Qy 654 CTGCGAGGCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAG 713 

I I I I I III I I I I I I I I I I I II I III 

Db 1242 GTGCAAAGTGATCAGCAAATTAGGAAATGACAGTGCCTCTGCCT^TATCACCATCGTGGA 1301 

Qy 714 CGTGAGCACCACCCTGTCATCC TGGTCGGGGCACGCCCGGAAGTGCAACGAGAC 7 67 

I I I I I I I I II I I I I I I I I II I I II 

Db 1302 ATCAAACGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAA 1361 

Qy 768 AGCCAAGTCCTATTGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCT 827 

I II I I I I I I I I I I I I M I I I I I I I I I I I II 

Db 1362 GGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGTW^GACCTTTCAAACCC 1421 

Qy 828 CTC CTGCAAGTGTCCTGTGGGATACACCGGGGACAGGTGTCAGCAGTTCGC 878 

III I I I I I II I I I I I M I I I I I I I I I I I II 

Db 1422 CTCGAGATACTTGTGCAAGTGCCCAAATGAGTTTACTGGTGATCGCTGCCAAAACTACGT 1481 

Qy 87 9 AATGGTCAACTTCTCC 894 

I I I I I II I I I II I 
Db 1482 AATGGCCAGCTTCTAC 14 97 
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Sequence : 

Scoring table: 



1 atgaggcgcgacccggcccc caatggtcaacttctcctaa 897 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-09-864-675-3 

; Sequence 3, Application US/09864675 
; Patent No. US20020081286A1 
; GENERAL INFORMATION: 



; APPLICANT: Marchionni, Mark 

; TITLE OF INVENTION: NRG-2 NUCLEIC ACID MOLECULES, 

; TITLE OF INVENTION: POLYPEPTIDES, AND DIAGNOSTIC AND THERAPEUTIC METHODS 

; FILE REFERENCE: 04585/049002 

; CURRENT APPLICATION NUMBER: US/ 09/ 8 64 , 675 

; CURRENT FILING DATE: 2001-05-23 

; PRIOR APPLICATION NUMBER: US 60/206,495 

; PRIOR FILING DATE: 2000-05-23 

; NUMBER OF SEQ ID NOS : 18 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 3 

LENGTH: 8 97 
; TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-864-675-3 

Query Match 100.0%; Score 897; DB 9; Length 897; 

Best Local Similarity 100.0%; Pred. No. 2.2e-244; 

Matches 8 97; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I M I I I I I I I I M M I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I 

Db 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

I M M I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I M I I I I I I I I I I 

Db 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 120 

Qy 121 GGCT^AGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 18 0 

I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
Db 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 180 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

M I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 30 0 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M 
Db 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

Qy 3 61 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I M I I I I I I I 
Db 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 42 0 

Qy 421 TGCGCCACCCGGCCCAAGTTGT^GAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 48 0 

I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I { I I I I I I I M I I I I 
Db 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 48 0 

Qy 481 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTC7VAG 54 0 

M I I I M M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I 

Db 481 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTT^TCCCCAGCCTTCCTACCGTTGGTTCAAG 54 0 



Qy 



541 GATGGCAAGGAGCTC7UVCCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 



Db 


541 


GAT G G C AAGGAG CT CAAC C G C AG C C GAGAC ATT r r AT r A A AT A T n r a A r r^(^ r A r: a a 


D U U 


Qy 


601 


AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 


660 






1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


AAC T CAC GACT AC AGT T CAACAAGGT GAAG GT G A (^r; A r r T r; r: r: A r T A T r T r r" r n r 


D O U 


Qy 


661 


GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 


720 






1 M 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


GC C GAGAAC AT P r T GG GGAAG G AC AC r GT r P fi(^(^nr r r; riP T T T A PHT r* A a r* A p r r T r n r* 

vj^^ojT.o.rxiT.^j^x Vw.^ X oooonjT.oo./T.^.r\w^o X ^v^oooo^L^oo^ X X X J~\\^ o 1 V-./\r\L-.rt, w o X orAo 


Ton 
/ z u 


Qy 


721 


ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 


780 






1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 




Db 


721 


APPAPPPTGTP ATPPTGGTPGGGGPAPGPPPriPA A^iTr^P A APr^APAPamnaarTrT'TflT' 

\^r\\^ \^\^ ± \j X v^^TT. J. ^ -L oo X o o o o o*^^ ^ ooAVrto X o\--/Art.V_, o/\o/\\^ /\o L-jn/A-o X L-L^ i. /\ J. 


"7 Q ri 
/ o u 


Ov 


781 


Tf^Pf^TP AATf^PAPPPriTPT (^PT APT a PAT'Pr' a P^rr'aTr' a 7^r'r*Ts.r*r"vr'''vr'r'^r'm\ a /"Ti/^rp 
X o^oX L^/-\-MX o^jAioo^oX ^X oL- X-rt.\jX/\\^/\X k>or\ooo\^/\i X L-1 L-L-l oU/\A.ol oi 


o 4 U 






1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


781 


TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 


840 


Qy 


841 


CCTGTGGGATACACCGGGGACAGGTGTCAGCAGTTCGCAATGGTCAACTTCTCCTAA 897 








1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 M 1 1 1 1 M 1 M 1 1 1 




Db 


841 


CCTGTGGGATACACCGGGGACAGGTGTCAGCAGTTCGCAATGGTCAACTTCTCCTAA 897 





RESULT 2 
US-09-864-675-1 

; Sequence 1, Application US/09864675 

; Patent No. US20020081286A1 

; GENERAL INFORMATION: 

; APPLICANT: Marchionni^ Mark 

; TITLE OF INVENTION: NRG-2 NUCLEIC ACID MOLECULES, 

; TITLE OF INVENTION: POLYPEPTIDES, AND DIAGNOSTIC AND THERAPEUTIC METHODS 

; FILE REFERENCE: 04585/049002 

; CURRENT APPLICATION NUMBER: US/ 09/ 8 64 , 67 5 

; CURRENT FILING DATE: 2001-05-23 

; PRIOR APPLICATION NUMBER: US 60/206,495 

; PRIOR FILING DATE: 2000-05-23 

; NUMBER OF SEQ ID NOS : 18 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 1 

LENGTH: 994 
; TYPE: DNA 
; ORGANISM: Homo sapiens 
US-09-864-675-1 



Query Match 94.6%; Score 849; DB 9; Length 994; 

Best Local Similarity 98.3%; Pred. No. 9.7e-231; 

Matches 858; Conservative 0; Mismatches 15; Indels 0; Gaps 0; 

Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 120 

I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 



121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 180 

I I I I M M I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I M I M I I I M I I I I I I 

121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 18 0 
Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

I I I I N I I I I I I I M I M I I I I I I I I I I I I I I I M M I I M I I I I I I I M M I I I I M I I 

Db 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

I I M M I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 

Db 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGi^AAGGAACCAG 300 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGT^ACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I M I M M I I I I M I I I I M I I I I M I I I I I I I M I I I I I I I I I I { I I I I M I I I I I I I I 

Db 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCT^GATCCTGTGCACTGAC 42 0 

I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 361 CCCCTCGATACCT^CGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 420 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 48 0 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I 
Db 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 480 

Qy 481 CT^TCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 4 81 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 54 0 

Qy 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

M M I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I M I I I I I I 

Db 541 GATGGCiyVGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

Qy 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I M I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 720 

I M M I I I M I I M I I I I I I M M I I I I I I I I I I I I M I I I I I I I I I I I I I I M 

Db 661 GCCGAGAACATCCTGGGGTyVGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 72 0 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGG7UVGTGCAACGAGACAGCCAAGTCCTAT 78 0 

Qy 781 TGCGTC7VATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 840 

I I I I I I I I M M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGT 84 0 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

II I I I I I III I I I III III 

Db 841 CCAAATGGATTCTTCGGACAGAGATGTTTGGAG 873 

RESULT 3 
US-10-096-241-5 

; Sequence 5, Application US/10096241 
; Publication No. US2 0020127594A1 



GENERAL INFORMATION: 
; APPLICANT: Gearing, David P. 

; Busfield, Samantha J. 

TITLE OF INVENTION: DON-1 GENE AND POLYPEPTIDES 

AND USES THEREFOR 
NUMBER OF SEQUENCES: 33 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson P.C. 
; STREET: 225 Franklin Street 

CITY: Boston 
STATE: MA 
COUNTRY: US 
ZIP: 02110-2804 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 
; COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 
; SOFTWARE: FastSEQ Version 2.0 

; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/096, 241 
FILING DATE: 12-Mar-2002 
; CLASSIFICATION: <Unknown> 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/699,591 
FILING DATE: 19-AUG-1996 
ATTORNEY/AGENT INFORMATION: 
; NAME: Fasse, J. Peter 

REGISTRATION NUMBER: 32,983 
REFERENCE/DOCKET NUMBER: 07334/022001 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 617-542-5070 
TELEFAX: 617-542-8906 
; TELEX: <Unknown> 

INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1884 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/KEY: Coding Sequence 
LOCATION: 664... 1883 
; OTHER INFORMATION: 

SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-10-096-241-5 

Query Match 93.1%; Score 835.4; DB 14; Length 1884; 

Best Local Similarity 98.1%; Pred. No. 8.2e-227; 

Matches 856; Conservative 0; Mismatches 16; Indels 1; Gaps 1; 
Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I i M I I I I I I I I I I M M I I I I I M I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I M I 

Db 218 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 277 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 120 

I I I I M I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 



278 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 337 
Qy 121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 180 

M I I M I M I I I I I I I I I I I I I I I I I I I I I M M I I I I M I M I I I I Ill 

Db 338 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 397 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

I I I II I I I M I I I I M I II I I I I I II II I II I I I I I I I I I I I I I 

Db 398 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGAC7VAGTGGCCGCTCCGGAGCGGG 457 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

M I I I I I I I M I I I I I II I I I M I I I I M I I I I I I I I II I II I I I I I I I I I I II I I M I I 

Db 458 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 517 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I I M I I I II I I I I I I I I I I II I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I M I I 
Db 518 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 577 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 420 

I I I M I I M I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I II I I I M I 

Db 578 CCCCT-GATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGGC 636 

Qy 421 TGCGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAG 4 80 

Mini I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I II I I I I I 

Db 637 TGCGCCACCCGGCCCAAGTTGAAGAAGATGJDAGAGCCAGACGGGACAGGTGGGTGAGAAG 696 

Qy 4 81 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 540 

I I I I I I I M I I I I M I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I M M I I I I I I I I I 
Db 697 CAATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAG 756 

Qy 541 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 600 

I M I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 

Db 757 GATGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAG 816 

Qy 601 AACTCACGACTACAGTTCAACTy^GGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 660 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I M I I I I I I I I I I 
Db 817 AACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAG 876 

Qy 661 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 720 

M I I I I II I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 877 GCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGC 936 

Qy 721 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 78 0 

I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I M 

Db 937 ACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTAT 996 

Qy 781 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGT 840 

I M M I I M M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I II I III 

Db 997 TGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGT 1056 

Qy 841 CCTGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

II I I I I I III I I I I I I III 

Db 1057 CCAAATGGATTCTTCGGACAGAGATGTTTGGAG 1089 



RESULT 4 
US-10-096-241-7 



; Sequence 7, Application US/10096241 
; Publication No. US2002 0127594A1 

GENERAL INFORMATION: 
; APPLICANT: Gearing, David P. 

; Bus field, Samantha J. 

TITLE OF INVENTION: DON-1 GENE AND POLYPEPTIDES 

AND USES THEREFOR 
; NUMBER OF SEQUENCES: 33 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson P.C. 
STREET: 225 Franklin Street 
CITY: Boston 
STATE: MA 
COUNTRY: US 
ZIP: 02110-2804 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/096, 241 
; FILING DATE: 12-Mar-2002 

; CLASSIFICATION: <Unknown> 

PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: 08/699,591 

FILING DATE: 19-AUG-1996 
ATTORNEY/AGENT INFORMATION: 
; NAME: Fasse, J. Peter 

REGISTRATION NUMBER: 32,983 
; REFERENCE/DOCKET NUMBER: 07334/022001 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 617-542-5070 
; TELEFAX: 617-542-8906 

TELEX: <Unknown> 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1476 base pairs 
TYPE: nucleic acid 
STRANDEDNESS: single 
; TOPOLOGY: linear 

MOLECULE TYPE: cDNA 
FEATURE : 

NAME/KEY: Coding Sequence 
LOCATION: 69... 14 7 5 
OTHER INFORMATION: 
; SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

US-10-096-241-7 

Query Match 47.7%; Score 427.8; DB 14; Length 1476; 

Best Local Similarity 89.8%; Pred. No. 3.9e-lll; 

Matches 459; Conservative 0; Mismatches 52; Indels 0; Gaps 0 

Qy 363 CCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTG 422 

M I I I M III I I II I I I I M I I M 

Db 98 CCGCGGCAAGAAGCACCCAGAGGGGAGGAAGCGGGAGAGGGAGCCCGATCCCGGGGAGAA 157 



Qy 

Db 


423 
158 


CGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 
1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 


482 
217 


Qy 


483 


ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 
M M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 
ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 


542 


Db 


218 


277 


Qy 

Db 


543 
278 


TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 

1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 { 1 1 M M 
TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 


602 
337 


Qy 


603 


CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 
M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 


662 


Db 


338 


397 


Qy 


663 


CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 

1 M 1 1 1 1 1 1 1 M 1 M I 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 


722 


Db 


398 


457 


Qy 

Db 


723 
458 


CACCCTGTCATCCTGGTCGGGGCACGCCCGGT^GTGCAACGAGACAGCCAAGTCCTATTG 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 M 1 M 1 1 1 1 

CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 


782 
517 


Qy 


783 


CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 t i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 t 1 t 1 1 1 1 1 1 1 

1 1 M M 1 M 1 i 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 M 1 M 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCC 


842 


Db 


518 


577 


Qy 


843 


TGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

1 1 1 1 1 III 1 1 1 1 II III 
AAATGGATTCTTCGGACAGAGATGTTTGGAG 608 




Db 


578 





RESULT 5 

US-10-096-241-31 

; Sequence 31, Application US/10096241 

; Publication No. US20020127594A1 

; GENERAL INFORMATION: 

; APPLICANT: Gearing, David P. 

; Bus field, Samantha J. 

; TITLE OF INVENTION: DON-1 GENE AND POLYPEPTIDES 

; AND USES THEREFOR 

NUMBER OF SEQUENCES: 33 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson P.C. 
; STREET: 225 Franklin Street 

CITY: Boston 

STATE: MA 

COUNTRY: US 
; ZIP: 02110-2804 

; COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 
; OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/096,241 



FILING DATE: 12-Mar-2002 
; CLASSIFICATION: <Unknown> 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/699,591 
FILING DATE: 19-AUG-1996 
; ATTORNEY/AGENT INFORMATION: 

; N7\ME: Fasse, J. Peter 

; REGISTRATION NUMBER: 32,983 

REFERENCE/DOCKET NUMBER: 07334/022001 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 617-542-5070 
; TELEFAX: 617-542-8 906 

TELEX: <Unknown> 
INFORMATION FOR SEQ ID NO: 31: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 2268 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
FEATURE : 

; NAME/KEY: Coding Sequence 

LOCATION: 69... 2009 

OTHER INFORMATION: 
SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
US-10-096-241-31 



Query Match 47.7%; Score 427.8; DB 14; Length 2268; 

Best Local Similarity 89.8%; Pred. No. 4.3e-lll; 

Matches 459; Conservative 0; Mismatches 52; Indels 0; Gaps 0; 



Qy 


363 


CCTCGATACCT^ACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTG 
1 1 M 1 1 I III 1 III 1 1 1 1 1 1 1 1 II 
CCGCGGCAAGAAGCACCCAGAGGGGAGGAAGCGGGAGAGGGAGCCCGATCCCGGGGAGAA 


422 


Db 


98 


157 


Qy 


423 


CGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCA 
1 1 1 M M 1 1 II 1 1 II 1 II 1 1 1 1 1 II M 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II II 1 M 1 1 
AGCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGT^AGCA 


482 


Db 


158 


217 


Qy 


483 


ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 
1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 
ATCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGA 


542 


Db 


218 


277 


Qy 


543 


TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 
M 1 1 II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 
TGGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAA 


602 


Db 


278 


337 


Qy 


603 


CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 

1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 M M M 1 1 1 M 
CTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGC 


662 


Db 


338 


397 


Qy 


663 


CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 
M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 M M 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 
CGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCAC 


722 


Db 


398 


457 


Qy 


723 


CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 


782 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I M I 



Db 



458 CACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTG 517 



Qy 7 83 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCT^CCAGCTCTCCTGCAAGTGTCC 842 

M I I I I I I I I I M M I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M 

Db 518 CGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCC 577 

Qy 843 TGTGGGATACACCGGGGACAGGTGTCAGCAG 873 

I M I I III I II II I III 
Db 578 AAATGGATTCTTCGGACAGAGATGTTTGGAG 608 



RESULT 6 
US-10-271-416-6 

Sequence 6, Application US/10271416 
Publication No. US20040043021A1 
GENERAL INFORMATION: 
APPLICANT: Keith, Tim 
APPLICANT: Little, Randall D. 
APPLICANT: Van Eerdewegh, Paul 
APPLICANT: Dupuis, Josee 
APPLICANT: Del Mastro, Richard G. 
APPLICANT: Allen, Kristina 

TITLE OF INVENTION: NUCLEOTIDE AND AMINO ACID SEQUENCES 
TITLE OF INVENTION: RELATING TO RESPIRATORY DISEASES AND OBESITY 
FILE REFERENCE: 2976-4045 

CURRENT APPLICATION NUMBER: US/10/271, 416 
CURRENT FILING DATE: 2002-10-11 
PRIOR APPLICATION NUMBER: 60/328,424 
PRIOR FILING DATE: 2001-10-11 
NUMBER OF SEQ ID NOS : 9 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 6 

LENGTH: 22 693 
TYPE: DNA 

ORGANISM: Homo sapien 
US-10-271-416-6 

Query Match 47.4%; Score 424.8; DB 13; Length 22693; 

Best Local Similarity 94.2%; Pred. No. 5.3e-110; 

Matches 452; Conservative 0; Mismatches 27; Indels 1; Gaps 1; 

ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 
M I I I II I I I II I I I I I I I I II I I I I M I I I I II I I I I I I I I I I II I I I I I II I M I II I 
ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 20868 

TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 
I M I I I I I I I I I I I I II I I I I II I I I I I I I I I I I II II I I I I I I I I I I I I M I I I I I I I I 
TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 20928 

GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 18 0 
I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I II II M I I I I II I I I I I I I M I I 
GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 2 0988 

CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

I I I M I M I I I I II M II I I I I II I I I I II I I M I I I M I I I M I I I I I I I II I I I I I II 

CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 2104 8 



Qy 


1 


Db 


20809 


Qy 


61 


Db 


20869 


Qy 


121 


Db 


20929 


Qy 


181 


Db 


20989 



Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

I I I I I M I I I I I I I I I I M I M M I M I I I M I M I I I I I M I I I I I M I I I I I I I I I I I 

Db 21049 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 21108 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I 
Db 21109 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 21168 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 42 0 

I I I I I I I I I I I I I M I I I M I I I I I I I I I I M I I M I I M I I I I I I I I I I I I I I I I I I I I 

Db 21169 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 21228 

Qy 421 TGC-GCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAA 479 

I I I I I Mill I I III I I I I I I I I I I I I I I I M 

Db 2122 9 TGCGGTGAGTCGCCCCCTCCCTTTGCTGGAGAAAGGGGGGAGGGGCGAGGTGGTGGAGAA 2128 8 



RESULT 7 

US-10-447-839A-10 

; Sequence 10, Application US/10447839A 

; Publication No. US20040018181A1 

; GENERAL INFORMATION: 

; APPLICANT: Kufe, Donald W. 

; APPLICANT: Kharbanda, Surender 

; APPLICT^T: Weitman, Steven D. 

; TITLE OF INVENTION: MUCl INTERFERENCE RNA COMPOSITIONS AND METHODS DERIVED 
THEREFROM 

; FILE REFERENCE: 1000.05,009 

; CURRENT APPLICATION NUMBER: US/10/447, 839A 

CURRENT FILING DATE: 2003-05-29 
; PRIOR APPLICATION NUMBER: 10/293,391 
; PRIOR FILING DATE: 2002-11-13 

PRIOR APPLICATION NUMBER: 09/951,938 
; PRIOR FILING DATE: 2001-09-11 
; PRIOR APPLICATION NUMBER: 60/231,841 
; PRIOR FILING DATE: 2000-09-11 
; NUMBER OF SEQ ID NOS : 109 
; SOFTWARE: Patentin version 3.2 
; SEQ ID NO 10 
; LENGTH: 1054 

TYPE: DNA 
; ORGANISM: Homo Sapien 
US-10-447-839A-10 

Query Match 47.3%; Score 424; DB 16; Length 1054; 

Best Local Similarity 100.0%; Pred. No. 4.3e-110; 

Matches 424; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 60 

I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I I II I I I I I I I I I I I I I I M I I I I I I I I I 
Db 589 ATGAGGCGCGACCCGGCCCCCGGCTTCTCCATGCTGCTCTTCGGTGTGTCGCTCGCCTGC 648 

Qy 61 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 12 0 

I I I M I I II II I I I I I I II M II I II I II I I I I I I I I I I I I I I I I I 

Db 649 TACTCGCCCAGCCTCAAGTCAGTGCAGGACCAGGCGTACAAGGCACCCGTGGTGGTGGAG 708 



Qy 



121 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 18 0 



Db 7 09 GGCAAGGTACAGGGGCTGGTCCCAGCCGGCGGCTCCAGCTCCAACAGCACCCGAGAGCCG 7 68 

Qy 181 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 24 0 

I I M I I I M I I I M I M I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 

Db 769 CCCGCCTCGGGTCGGGTGGCGTTGGTAAAGGTGCTGGACAAGTGGCCGCTCCGGAGCGGG 82 8 

Qy 241 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 300 

I I M M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 82 9 GGGCTGCAGCGCGAGCAGGTGATCAGCGTGGGCTCCTGTGTGCCGCTCGAAAGGAACCAG 8 88 

Qy 301 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 360 

I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I 
Db 8 89 CGCTACATCTTTTTCCTGGAGCCCACGGAACAGCCCTTAGTCTTTAAGACGGCCTTTGCC 948 

Qy 361 CCCCTCGATACCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 949 CCCCTCGATACCAACGGCATW^TCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGAC 1008 

Qy 421 TGCG 424 

I I I I 

Db 1009 TGCG 1012 



RESULT 8 
US-10-096-241-3 

; Sequence 3, Application US/10096241 

; Publication No. US20020127594A1 

; GENERAL INFORMATION: 

; APPLICANT: Gearing, David P. 

; Bus field, Samantha J. 

TITLE OF INVENTION: DON-1 GENE AND POLYPEPTIDES 

AND USES THEREFOR 
; NUMBER OF SEQUENCES: 33 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Fish & Richardson P.C. 

STREET: 225 Franklin Street 

CITY: Boston 

STATE: MA 

COUNTRY: US 
; ZIP: 02110-2804 

; COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 2.0 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/10/096, 241 

; FILING DATE: 12-Mar-2002 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/699,591 

FILING DATE: 19-AUG-1996 
ATTORNEY/AGENT INFORMATION: 
; NAME: Fasse, J. Peter 

REGISTRATION NUMBER: 32,983 

REFERENCE/ DOCKET NUMBER: 07334/022001 



TELECOMMUNICATION INFORMATION: 

TELEPHONE: 617-542-5070 
; TELEFAX: 617-542-8906 

TELEX: <Unknown> 
; INFORMATION FOR SEQ ID NO: 3: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 1607 base pairs 

TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
; MOLECULE TYPE: cDNA 

; FEATURE : 

NAME/KEY: Coding Sequence 

LOCATION: 7 9... 621 
; OTHER INFORMATION: 

SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
US-10-096-241-3 

Query Match 45.2%; Score 4 05.4; DB 14; Length 1607; 

Best Local Similarity 87.9%; Pred. No. 9e-105; 

Matches 442; Conservative 0; Mismatches 61; Indels 0; Gaps 0; 

Qy 371 CCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCC 430 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I 
Db 2 CTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCC 61 

Qy 431 GGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAATCGCTGA 490 

I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I II M I I M I I I I I II I 
Db 62 GGCCCAAGCTGAAGAAGATGAAGAGCCAGACAGGAGAGGTGGGTGAGAAGCAGTCGCTCA 121 

Qy 4 91 AGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGGCAAGG 550 

I II M I I I I I I I I II II II I I I I I I I I Mill M I I II I M I I I I I I I I II II 
Db 122 AGTGTGAGGCAGCGGCGGGAAACCCCCAGCCCTCCTATCGCTGGTTCAAGGATGGCAAGG 181 

Qy 551 AGCTCT^CCGCAGCCGAGACATTCGCATCT^AATATGGCAACGGCAGAAAGAACTCACGAC 610 

I I I I I I I I I II II II I I I I I I I I I II I I I I I I M I I I I I I I I I I I I I I M I 
Db 182 AACTCAACCGGAGTCGTGATATTCGCATCAAGTATGGCAATGTCAGAAAGAACTCACGGC 241 

Qy 611 TACAGTTCAACAAGGTGT^GGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGAGAACA 670 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I II M I 
Db 242 TACAGTTC7\ACAAAGTGAGGGTGGAGGATGCCGGGGAGTACGTCTGTGAGGCCGAGAACA 301 

Qy 671 TCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCACCACCCTGT 730 

I I I I I I I I I I M I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 302 TCCTTGGGAAGGACACCGTGAGGGGCCGACTCCATGTCT^CAGCGTGAGCACCACTCTGT 361 

Qy 731 CATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTGCGTCAATG 790 

I I I I I II M I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I 
Db 362 CATCCTGGTCGGGACATGCCCGGAAGTGCAATGAGACCGCCAAGTCCTACTGTGTGAATG 421 

Qy 791 GAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCCTGTGGGAT 850 

I II I M I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I M I I I I I I I I I 

Db 422 GAGGCGTGTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGC/AATGTCCAAACGGAT 4 81 

Qy 851 ACACCGGGGACAGGTGTCAGCAG 873 

I III I I I II I III 
Db 482 TCTTCGGACAGAGATGTTTGGAG 504 



RESULT 9 
US-10-096-241-1 

; Sequence 1, Application US/10096241 
; Publication No. US20020127594A1 
GENERAL INFORMATION: 

APPLICANT: Gearing, David P. 
; Busfield, Samantha J. 

TITLE OF INVENTION: DON-1 GENE AND POLYPEPTIDES 
; AND USES THEREFOR 

NUMBER OF SEQUENCES: 33 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson P.O. 
STREET: 225 Franklin Street 
CITY: Boston 
STATE: MA 
COUNTRY: US 
; ZIP: 02110-2804 

COMPUTER READABLE FORM: 
; MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
; SOFTWARE: FastSEQ Version 2.0 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/096,241 
; FILING DATE: 12-Mar-2002 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: 08/699,591 

FILING DATE: 19-AUG-1996 
ATTORNEY/AGENT INFORMATION: 
; NAME: Fasse, J. Peter 

REGISTRATION NUMBER: 32,983 
REFERENCE/DOCKET NUMBER: 07334/022001 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 617-542-5070 
TELEFAX: 617-542-8906 
; TELEX: <Unknown> 

; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 2467 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS: single 
; TOPOLOGY: circular 

MOLECULE TYPE: cDNA 
; FEATURE : 

; N7\ME/KEY: Coding Sequence 

LOCATION: 7 9... 18 93 

OTHER INFORMATION: 
SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
US-10-096-241-1 

Query Match 45.2%; Score 405.4; DB 14; Length 2467; 

Best Local Similarity 87.9%; Pred. No. le-104; 

Matches 442; Conservative 0; Mismatches 61; Indels 0; Gaps 0; 



Qy 371 CCAACGGCAAAAATCTCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCC 4 30 

I M I M I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 CTAACGGCAAAAACATCAAGAAAGAGGTGGGCAAGATCCTGTGCACTGACTGCGCCACCC 61 

Qy 431 GGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAATCGCTGA 4 90 

I I I I I I M MM I M M M M M I Ml M M M M M M M M M M I I 

Db 62 GGCCCAAGCTGAAGAAGATGAAGAGCCAGACAGGAGAGGTGGGTGAGAAGCAGTCGCTCA 121 

Qy 491 AGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGATGGCAAGG 550 

I M M M M M M M M I I M M M M M M I M M M M M M M M M M I 

Db 122 AGTGTGAGGCAGCGGCGGGAAACCCCCAGCCCTCCTATCGCTGGTTCAAGGATGGCAAGG 181 

Qy 551 AGCTCi^ACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAGAAAGAACTCACGAC 610 

I M M M M M M M M M M M M I I M M M I I M M M M M M M I I 

Db 182 AACTCAACCGGAGTCGTGATATTCGCATCAAGTATGGCAATGTCAGAAAGAACTCACGGC 241 

Qy 611 TACAGTTCTyvCAAGGTGTVAGGTGGAGGACGCTGGGGAGTATGTCTGCGAGGCCGAGAACA 670 

M M M I M Ml M M M M I M I M M I M M I M M M M 

Db 242 TACAGTTCAACAAAGTGAGGGTGGAGGATGCCGGGGAGTACGTCTGTGAGGCCGAGAACA 3 01 

Qy 671 TCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAGCGTGAGCACCACCCTGT 730 

MM I M M M M M M I M M M I M I M M M M M M M M M M MM 

Db 302 TCCTTGGGAAGGACACCGTGAGGGGCCGACTCCATGTCAACAGCGTGAGCACCACTCTGT 361 

Qy 731 CATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAGTCCTATTGCGTCAATG 7 90 

M M M M M M I M M M M M M M M M M I M M M M M I I I M MM 

Db 362 CATCCTGGTCGGGACATGCCCGGAAGTGCAATGAGACCGCCAAGTCCTACTGTGTGAATG 421 

Qy 791 GAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAGTGTCCTGTGGGAT 850 

M M M I M M M M M M M M M M M M M I M M I M I M I M M I MM 

Db 422 GAGGCGTGTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGCAAATGTCCAAACGGAT 4 81 

Qy 851 ACACCGGGGACAGGTGTCAGCAG 8 73 

I Ml I M M I Ml 
Db 482 TCTTCGGACAGAGATGTTTGGAG 504 



RESULT 10 

US-10-029-386-26613 

; Sequence 26613;. Application US/10029386 

; Publication No. US20030194704A1 

; GENERAL INFORMATION: 

; APPLICANT: Penn, Sharron G. 

; APPLICANT: Rank, David R. 

; APPLICANT: Hanzel, David K, 

; TITLE OF INVENTION: HUMAN GENOME-DERIVED SINGLE EXON NUCLEIC ACID PROBES 
USEFUL FOR GENE 

; TITLE OF INVENTION: EXPRESSION ANALYSIS TWO 

FILE REFERENCE: AEOMICA-X-2 
; CURRENT APPLICATION NUMBER: US/10/029,386 
; CURRENT FILING DATE: 2001-12-20 
; NUMBER OF SEQ ID NOS : 34288 

; SOFTWARE: Annomax Sequence Listing Engine vers. 1.1 
; SEQ ID NO 26613 

LENGTH: 201 
; TYPE: DNA 
; ORGANISM: Homo sapiens 



FEATURE : 



OTHER 
OTHER 
OTHER 
OTHER 
OTHER 
OTHER 
OTHER 
OTHER 
US-10-029- 



INFORMATION: 
INFORMATION: 
INFORMATION: 
INFORMATION: 
INFORMATION: 
INFORMATION: 
INFORMATION: 
INFORMATION: 
•386-26613 



MAP TO CHR5.3 

EXPRESSED IN HEART, SIGNAL =0.55 
EXPRESSED IN ADULT LIVER, SIGNAL = 0.4 9 
EXPRESSED IN FETAL LIVER, SIGNAL =0.72 
EXPRESSED IN BRAIN, SIGNAL =0.66 
SWISSPROT HIT: 014511, EVALUE 3.00e-29 
NT HIT: AF119152.1, EVALUE l.OOe-109 
EST HUMAN HIT: BF108794.1, EVALUE 3,00e-93 



Query Match 19.3%; Score 173; DB 15; Length 201; 

Best Local Similarity 100.0%; Pred. No, 5.1e-39; 

Matches 173; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 



Db 



42 4 GCCACCCGGCCCAAGTTGAAGAAGATGTiAGAGCCAGACGGGACAGGTGGGTGAGAAGCAA 483 
I I I I M I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
27 GCCACCCGGCCCAAGTTGAAG7y\.GATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAA 86 



Qy 

Db 

Qy 

Db 



4 84 TCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGAT 543 
I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
87 TCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGAT 14 6 

544 GGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAG 596 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M 
147 GGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAG 199 



RESULT 11 
US-10-447-839A-11 

; Sequence 11, Application US/10447839A 

; Publication No. US20040018181A1 

; GENERAL INFORMATION: 

; APPLICANT: Kufe, Donald W. 

; APPLICANT: Kharbanda, Surender 

; APPLICANT: Weitman, Steven D. 

; TITLE OF INVENTION: MUCl INTERFERENCE RNA COMPOSITIONS AND METHODS DERIVED 
THEREFROM 

; FILE REFERENCE: 1000.05.009 

; CURRENT APPLICATION NUMBER: US/ 10/ 447 , 839A 

; CURRENT FILING DATE: 2003-05-29 

; PRIOR APPLICATION NUMBER: 10/293,391 

; PRIOR FILING DATE: 2002-11-13 

; PRIOR APPLICATION NUMBER: 09/951,938 

; PRIOR FILING DATE: 2001-09-11 

; PRIOR APPLICATION NUMBER: 60/231,841 

; PRIOR FILING DATE: 2000-09-11 

; NUMBER OF SEQ ID NOS : 109 

; SOFTWT^E: Patentin version 3.2 

; SEQ ID NO 11 

; LENGTH: 419 

TYPE: DNA 
; ORGANISM: Homo Sapien 
US-10-447-839A-11 



Query Match 19.3%; Score 173; DB 16; Length 419; 

Best Local Similarity 100.0%; Pred. No. 6.1e-39; 



Matches 173; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 424 GCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAA 4 83 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 50 GCCACCCGGCCCAAGTTGAAGTiAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAA 109 

Qy 4 84 TCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGAT 543 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 110 TCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGAT 169 

Qy 544 GGCT^GGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAG 596 

I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 170 GGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCTWiLTATGGCAACGGCAG 222 



RESULT 12 

US-10-029-386-12913 

Sequence 12913, Application US/10029386 
Publication No. US20030194704A1 
GENERAL INFORMATION: 
APPLICANT: Penn, Sharron G. 
APPLICANT: Rank, David R. 
APPLICANT: Hanzel, David K. 

TITLE OF INVENTION: HUMAN GENOME-DERIVED SINGLE EXON NUCLEIC ACID PROBES 
USEFUL FOR GENE 

TITLE OF INVENTION: EXPRESSION ANALYSIS TWO 
FILE REFERENCE: AEOMICA-X-2 

CURRENT APPLICATION NUMBER: US/10/029, 386 
CURRENT FILING DATE: 2001-12-20 
NUMBER OF SEQ ID NOS : 34288 

SOFTWARE: Annomax Sequence Listing Engine vers. 1.1 
SEQ ID NO 12913 
LENGTH: 573 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 

OTHER INFORMATION: MAP TO CHR5 . 3 

OTHER INFORMATION: EXPRESSED IN HEART, SIGNAL =0.55 
OTHER INFORMATION: EXPRESSED IN ADULT LIVER, SIGN7VL =0.49 
OTHER INFORMATION: EXPRESSED IN FETAL LIVER, SIGNAL =0.72 
OTHER INFORMATION: EXPRESSED IN BRAIN, SIGNAL =0.66 
OTHER INFORMATION: SWISSPROT HIT: 014511, EVALUE 2.00e-28 
OTHER INFORMATION: NT HIT: AF119152.1, EVALUE O.OOe+OO 
OTHER INFORMATION: EST_HUMAN HIT: BG996653.1, EVALUE l.OOe-108 
US-10-029-386-12913 

Query Match 19.3%; Score 173; DB 15; Length 573; 

Best Local Similarity 100.0%; Pred. No. 6.6e-39; 

Matches 17 3; Conservative 0 ; Mismatches 0 ; Indels 0 ; Gaps 0 ; 

Qy 42 4 GCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAA 4 83 

I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 377 GCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAA 4 36 

Qy 4 84 TCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGAT 543 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I 
Db 437 TCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGAT 496 



Qy 544 GGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAG 596 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I 
Db 4 97 GGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAG 549 



RESULT 13 
US-10-271-416-7 

Sequence 7, Application US/10271416 
Publication No. US2004 0043021A1 
GENERAL INFORMATION: 
APPLICANT: Keith, Tim 
APPLICANT: Little, Randall D. 
APPLICANT: Van Eerdewegh, Paul 
APPLICANT: Dupuis, Josee 
APPLICANT: Del Mastro, Richard G. 
APPLICANT: Allen, Kristina 

TITLE OF INVENTION: NUCLEOTIDE AND AMINO ACID SEQUENCES 
TITLE OF INVENTION: RELATING TO RESPIRATORY DISEASES AND OBESITY 
FILE REFERENCE: 2976-4045 

CURRENT APPLICATION NUMBER: US/10/271, 416 
CURRENT FILING DATE: 2002-10-11 
PRIOR APPLICATION NUMBER: 60/328,424 
PRIOR FILING DATE: 2001-10-11 
NUMBER OF SEQ ID NOS : 9 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 7 

LENGTH: 45450 
TYPE: DNA 

ORGANISM: Homo sapien 
FEATURE : 

NAME/KEY: misc_f eature 
LOCATION: (1) . . . (45450) 
OTHER INFORMATION: n = A,T,C or G 
US-10-271-416-7 

Query Match 19.3%; Score 173; DB 13; Length 45450; 

Best Local Similarity 100.0%; Pred. No. 1.9e-38; 

Matches 173; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 424 GCCACCCGGCCCAAGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAA 4 83 

I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I M 
Db 702 6 GCCACCCGGCCC7\AGTTGAAGAAGATGAAGAGCCAGACGGGACAGGTGGGTGAGAAGCAA 708 5 

Qy 484 TCGCTGT^AGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGAT 543 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I 
Db 7086 TCGCTGAAGTGTGAGGCAGCAGCCGGTAATCCCCAGCCTTCCTACCGTTGGTTCAAGGAT 7145 

Qy 544 GGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAG 596 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 7146 GGCAAGGAGCTCAACCGCAGCCGAGACATTCGCATCAAATATGGCAACGGCAG 7198 



RESULT 14 
US-10-447-839A-12 

; Sequence 12, Application US/10447839A 
; Publication No. US2004001818 lAl 



GENERAL INFORMATION: 
APPLIC7\NT: Kufe, Donald W. 
APPLICANT: Kharbanda, Surender 
APPLICANT: Weitman, Steven D. 

TITLE OF INVENTION: MUCl INTERFERENCE RNA COMPOSITIONS AND METHODS DERIVED 
THEREFROM 

FILE REFERENCE: 1000.05.009 

CURRENT APPLICATION NUMBER: US/ 10/ 4 4 7 , 83 9A 
CURRENT FILING DATE: 2003-05-29 
PRIOR APPLICATION NUMBER: 10/293,391 
PRIOR FILING DATE: 2002-11-13 
PRIOR APPLICATION NUMBER: 09/951,938 
PRIOR FILING DATE: 2001-09-11 
PRIOR APPLICATION NUMBER: 60/231,841 
PRIOR FILING DATE: 2000-09-11 
NUMBER OF SEQ ID NOS : 10 9 
SOFTWARE: Patentin version 3.2 
SEQ ID NO 12 
LENGTH: 4 93 
TYPE: DNA 

ORGANISM: Homo Sapien 
US-10-447-839A-12 

Query Match 13.9%; Score 124.6; DB 16; Length 493; 

Best Local Similarity 93.5%; Pred. No, 3.5e-25; 

Matches 130; Conservative 0; Mismatches 9; Indels 0; Gaps 0; 

Qy 594 CAGAAAGAACTCACGACTACAGTTCAACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGT 653 

I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 227 CAGAAAG7\ACTCACGACTACAGTTC7\ACAAGGTGAAGGTGGAGGACGCTGGGGAGTATGT 286 

Qy 654 CTGCGAGGCCGAGT^CATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAG 713 

I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 287 CTGCGAGGCCGAGAACATCCTGGGGAAGGACACCGTCCGGGGCCGGCTTTACGTCAACAG 34 6 

Qy 714 CGTGAGCACCACCCTGTCA 732 

M II I I I I I I 

Db 347 CGGTAGGTGGGCCCAGACA 365 



RESULT 15 
US-10-447-839A-13 

; Sequence 13, Application US/10447839A 

; Publication No. US2004001818 lAl 

; GENERAL INFORMATION: 

; APPLICANT: Kufe, Donald W. 

; APPLICANT: Kharbanda, Surender 

; APPLICANT: Weitman, Steven D. 

; TITLE OF INVENTION: MUCl INTERFERENCE RNA COMPOSITIONS AND METHODS DERIVED 
THEREFROM 

; FILE REFERENCE: 1000.05.009 

; CURRENT APPLICATION NUMBER: US/ 10/447 , 8 39A 

; CURRENT FILING DATE: 2003-05-29 

; PRIOR APPLICATION NUMBER: 10/293,391 

; PRIOR FILING DATE: 2002-11-13 

; PRIOR APPLICATION NUMBER: 09/951,938 

; PRIOR FILING DATE: 2001-09-11 



PRIOR APPLICATION NUMBER: 60/231,841 
; PRIOR FILING DATE: 2000-09-11 
; NUMBER OF SEQ ID NOS : 109 
; SOFTWARE: Patentin version 3.2 
; SEQ ID NO 13 

LENGTH: 350 
; TYPE: DNA 

ORGANISM: Homo Sapien 
US-10-447-839A-13 

Query Match 13.8%; Score 124; DB 16; Length 350; 

Best Local Similarity 100.0%; Pred. No. 4.7e-25; 

Matches 124; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 715 GTGAGCACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGC7\ACGAGACAGCCAAG 774 




Db 



99 GTGAGCACCACCCTGTCATCCTGGTCGGGGCACGCCCGGAAGTGCAACGAGACAGCCAAG 158 



775 TCCTATTGCGTCAATGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGC 8 34 




Db 



159 TCCTATTGCGTCT^TGGAGGCGTCTGCTACTACATCGAGGGCATCAACCAGCTCTCCTGC 218 



Qy 



835 AAGT 838 



Db 



219 AAGT 222 



Search completed: August 15, 2004, 12:23:39 
Job time : 491.839 sees 



